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designing effective man-machine interfaces seems to be the 
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This thesis examines the principles of design of effec- 
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in achieving effective man=machine communication and the 
implementation issues and techniques for their use as 
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I. BACKGROUND 


Until recently it was required that the computer user 
should be a specialist with an appropriate education for 
handling the computer without causing serious problems. The 
requirements of the contemporary enterprise for as many 
users aS possible and the advance of the technology have 
brought many people in front of the computer not only for 
professional but also for private use. The problem todayame 
how these people would be able to take as much information 
as possible out of the data available in the computer's 
storage devices. 

Iwo major aspects have to be considered in the design of 
Man-machine communication; the characteristics of the user 
and the means of interaction between him and the computer. 
The study of user characteristics helps the designer 
establish objectives for the man-machine dialogue and the 
principles which govern the dialogue at the interface. Not 
all means of interaction are appropriate for all cases. For 
example, programming languages are inappropriate for users 
who have not learned programming and who probably would not 
be eager to do this, for example, administration staff. 
Natural languages are inappropriate where precise input 
interpretation 1s required. 

The cost factor is the most serious factor for the 
selection of an interface which takes into account the user 
and his needs. Another important factor is the variety in 
the features of the users which may result in the fact that 
a specific type of dialogue is appropriate for some of them 
but it is not appropriate for others. 

A man-machine interface is the medium of interaction 


between man and machine. For example as the steering wheel 
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of aocar is an interface where the driver andthe car 
interact, the terminal is an interface between the user and 
the computer . The interface may also be something abstract, 
like a language, which is the medium for transferring user 
ideas to the computer, which are then translated by the 
computer into appropriate actions. 

Given that a user's statement may have more’ than one 
meaning with respect to various sets of actions, it is 
possible that each set of actions will require its own 
interface. 

The requirements of the contemporary enterprise for 
users who are not specialists or experts in interacting with 
the computer, ask for interfaces which are user-friendly and 
are addressed to the average user. The design of these 
interfaces must take into account the needs of the average 
user as a human being and the user's characteristics and 
behavior when he faces the computer. 

This thesis concerns the category of the so called 
"linguistic" interfaces which includes the languages which 
are used as interfaces between man and computer. Natural 
Language seems at first look to meet the objectives for the 
design and implementation of a "humanized" interface which 
addresses the needs of the average user. 

A large number of natural language interfaces have been 
developed today. Yet, many practical questions remain 
unsolved. There are many arguments about the best 
architecture of natural language interfaces or whether these 
interfaces are preferable to the artificial language inter- 
faces designed with human factors in mind. Besides the 
problem for evaluating the best alternative there are some 
other questions which ask for an answer: 

e Is natural language implementation feasible ? 

° If natural language interfaces can successfully be 


implemented, do they solve the problem ? 
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* How transportable are they to new applications ? 
The purpose of the thesis is to search the design issues and 
implementation techniques of Natural Languages as man- 
machine interfaces and to present a possible design of an 
acceptable Natural Language Interface. 

In the first chapter of the thesis a comparison of 
nmatural languages to the existing programming and data 
retrieval language is made. The next two chapters examine 
the objectives and design principles of man-machine 
interfaces in general and provide suggestions about the 
requirements which man-emachine interfaces must satisfy. The 
next chapters concern the implementation issues and tech- 
niques for the natural languages, their practicality and 
suggest adesign for a general purpose portable Natural 
Language Interface. In the last chapter, the possibilities 
for promoting natural language applications in the military 


environment are examined. 
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II. NATURAL LANGUAGES VS COMPUTER LANGUAGES 

A computer performs its various functions by means of a 
program of instructions which is the target program. The 
various languages are the means by which people can communi- 
cate their ideas to the target program to make this program 
do what they desire. A person must generate a statement ina 
certain language. This statement is then communicated with 
the program, which in turn will translate the statement to 
appropriate actions. 

Hundreds of programming languages are available today. 
These languages are categorized in levels, with respect to 
how closely they represent the instructions in the form 
they take when they are in the computer central processor, 
which is called in the literature "machine code". In the 
lower level, there are those languages which are similar to 
the machine code and which are machine dependent. In the 
next higher level are the so called high level languages, 
which are problem oriented rather than machine oriented as 
lower level languages are. The higher level languages were 
developed to solve the problem of program compatibility 
between the different types of machines. Programs written in 
high-level language can be in some degree read and under- 
stood by someone who has not learned the language. 
Commercial languages resemble highly artificial but still 
recognizable, English; scientific languages resemble mathe- 
matical notation. These high level languages still require 
that a compiler be written for each machine on which the 
program is to be run. The fact that it is very common that 
several features of a specific language will be modified for 
several reasons, makes it possible for several "dialects" of 


the same language to exist. The attempts are still directed 


lead 


towards the production of a language which is sufficiently 
general for all purposes, can be understood by almost 
anybody and is sufficiently simple for the production of 
compilers of identical versions on all types of machines. 
The problem of ideal communication has still not been 


resolved although enough progress has been made. 


A. CHARACTERISTICS OF PROGRAMMING LANGUAGES 


The software psychologist Gerald Weinberg [Ref. 1], 
determined a number of psychological characteristics which 


exist in the programming languages, in various degrees. 


Loa Unitorm ey 


Uniformity characterizes the degree to which a 
language uses consistent notation, applies arbitrary 
restrictions and supports syntactic or semantic exceptions 
to the rule. For example ADA's "overloading" permits the 
use of identifiers and operators which may have alternative 


meanings within the same scope. 


2. Ambiguity 


Ambiguity characterizes the degree to which a 
statement is interpreted in different ways by different 
programmers although the compiler will always interpret 
that statement in one way. For example the arithmetic state- 


ment 
Z =e Bi Cera dD 


may be interpreted by one user as Z = A+ (B / C) * D and by 
another as Z=A+B/ (C * D). 
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3. Compactness 


Compactness of a programming language characterizes 
the amount of coded information that must be recalled from 
the human memory. Some compactness measures are the variety 
ef data types, operators, built in functions etc. APL for 
example which permits little code to be used for arithmetic 
and logical operations, exibits:’ a high degree of 


compactness. 


B. HUMAN MEMORY AND LANGUAGE CHARACTERISTICS 


Human memory characteristics strongly affect the way we 


use a particular language. Human memory is divided into 
synesthetic and sequential domains [Ref. 2:pp. 270]. The 
me Sst , allows us to remember and recognize things as a 


whole; the second, helps to recall the next thing ina 
sequence. Both these characteristics have a strong effect to 
two other language characteristics the locality and 
linearity. Locality is related to the degree to which the 
statements can be combined to block, support the structured 
constructs and code modularity and cohesiveness. Linearity 
is accociated with the degree that a sequence of logical 
operations facilitates human perception. 

The ability of a human to learn a new language is deter- 
Mined by the characteristic of tradition. For example a 
person with an ALGOL background should not be expected to 
have any strong difficulties in learning PASCAL. The same 
person will probably have problems in learning PROLOG or 
Err . 


C. CHARACTERISTICS OF NATURAL LANGUAGES 


Almost all programming today, is done in high-level 


languages which to some degree are designed with the human 
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in mind. However, high-level languages always remain highly 
stylized with strict rules which have to be learned and 
followed. This formalized type of dialogue is in many cases 
undesirable. A computer programmer often feels the need to 
give instructions to the computer as if he had given the 
Same instructions in another person. 

Considering Natural Language characteristics and artifi- 
Cial languages characteristics both similarities and big 


differences can be found. 
1. Humanized Syntax 


The presence of human syntax is the most significant 
characteristic of Natural Language. There are numerous words 
and word combinations which are used by human beings and 


found in a Natural Language. 


2. Disordered Word Arrangement 





Another important characteristic 1s that words in 
Natural Language are used disorderly and they are very often 
used for different purposes. They may for example be at one 


time verbs and the other time, nouns. 


3. Use Nore inpertecteE teres s toms 


Imperfect expressions of thought may be used and are 


"legal" for the creation of sentences. 


4. Presuposition 


Another characteristic is that one of presuposition 
[Ref. 3:pp. 40). This characteristic is a result of the 
thought that the recipient of the instruction has mental 
processes which are close to those of the person who gener- 


ates the instruction. 
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oe venom ty and Base of Learning 


These are the two most attractive features of 
Natural Language because they provide the user with freedom 
in constructing his statements and minimize the mental load 


for learning them. 


D. AMBIGUITY IN NATURAL LANGUAGES 


Many words and sentences of Natural Language are ambig- 
uous. This means that they may have more than one meaning. 
Even if an entire statement is unambiguous its parts may be 
ambiguous with respect to the context. To make the computer 
recognize the meaning of Natural Language statements 
requires programs of very high complexity and this is rather 


impractical in the majority of today's serial machines. 
Ll. Double Negatives 


Another case of ambiguity occurs when double 
negatives are encountered in the statement. A characteristic 


example is the problem which arises with the rules of 


Boolean Algebra. In conventional programming languages the 
expression "NOT FALSE" can be replaced by "TRUE" and the 
expression "NOT TRUE" by "FALSE". This cannot be done in 


Natural Language. 
Peeeolmitarity of Wording 


In many cases similar expressions may have different 


meanings. For example the expressions 
ae book of Jm G. Norton 
and 


A book of Physics 
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have similar wording but different meaning; The first means 
"a book written by J. G. Norton" and the second means "a 


book whose subject is physics". 


3. Time of Statement Execution within the Program 


Ambiguity also occurs when no distinction is made to 
various instructions with respect to the time of execution. 
This does not happen in programming languages where there is 
clear distinction between these instructions which have to 
be executed in the main program and those to be executed in 
subroutines. In addition, while in programming languages 
conditional instructions and conditions are clearly stated 
this is not always done in natural language. For example, in 


the following series of activities 


study chapter #5 
do exercises in chapter #4 


go for a walk 


no clear distinction is made about the desired time of 


execution of each statement with respect to the others. 
aa Use ef TAND | ; HOR” ’ NOT 


The use of "AND", "OR" and "NOT" is another case 
where confusion may be created because sometimes the "OR" 
may be conceived as inclusive and other times as exclusive, 
depending on the context. This of course does not happen in 


programming languages where their use is precisely defined. 
5. Pronouns 


Pronouns are also a potential source of ambiguity. 
Their presence generates serious interpretation problems 
because determining what they refer to can be defined only 
with the help of the context. 
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oe Moteametical Expressions 


Arithmetic expressions are another source of ambi- 
guity. Interpretation of numbers and arithmetical operations 
in natural language is not always a simple task. For example 


when we say 
France produces 2000 Mirage airplanes 


we do not specify if "2000" defines the model number or the 


quantity produced. 
7. The "dangling" ELSE 


A common ambiguity problem to both programming and 
natural languages is the problem of "dangling else" where we 
cannot specify to which "if" the "else" refers to. The 
difference is that although various programming languages 
provide various solutions to this problem, natural languages 
do not. For example, if we consider the series of program 
instructions displayed on Figure 2.1, we cannot specify to 
which "IF", "ELSE" refers to. 


If x > QO then 
yee Sarin =a) 
If x < O then 


Y= 1 * sqrt(-x) 





Figure 2.1 Dangling "ELSE". 
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E. NATURAL AND NON=-PROCEDURAL PROGRAMMING LANGUAGES 


There are some cases of natural language dialogue where 
possible misinterpretation 1s unimportant. If the conditions 
require a precise natural language input, the user must be 
confined in what he wants to say to a small vocabulary 
whose words are precisely defined. This means that there is 
always a trade-off between dialogue flexibility and 
preciseness. Humans are able to establish a meaning even 
for incomplete statements by examining those statements in 
context of their own view of the world. It is very difficult 
to provide the computer with such a huge context where all 
possible statements can be referenced. 

In most of the practical applications today, the vocabu- 
lary used is limited. In these cases, natural language tends 


to be structured by employing formal structures which are 


precisely defined. After this, natural language is not as 
much effective oor even acceptable, aS a programming 
language. There is today a trend to transit from procedural 


languages like COBOL, PL1l to non-procedural languages like 
FOCUS, INFO etc. A common feature of non-procedural program- 
ming languages and programming languages is that no precise 
step-by-step instructions are required but in some way a 
specification of the desired result. The functionality of 
the non=-procedural languages is so high that most organizZa- 


tions tend to adopt them. 


EF. NATURAL LANGUAGE IN DATABASE QUERYING 


Natural language may also be used to retrieve informa- 
tion from computerized databases. The use of natural 
language in database query is directly related to the 
benefit of taking advantage of the database processing (more 
information out of a given amount of data, increased user 


satisfaction etc.). 
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Two solutions exist basically to the problem of database 
querying: 

* Adapt the user to the machine by having him learn a 

formal artificial querying language. 

* Adapt the machine to the user by making it able to 

understand a section of natural language. 

The first solution presents several disadvantages which 
cannot be ignored. The most significant is the cost involved 
in training users to learn formal querying languages. Beyond 
this problem there is another significant issue. The fact 
that a formal language will be used does not eliminate the 
dangers of ambiguity where various users will state their 
requirements in their own different ways. 

The value of anatural language query is directly 
related to how effectively natural language can be dealt 
with a particular application. Natural language contributes 
to the understandability of the database queries by: 

¢ Providing capability of selecting subsets of file 
records with respect to the desired criteria. 
* Reflecting the data manipulation process. 

* Reflecting user's logical view of data. 

Measuring user's satisfaction with a specific natural 
language querying system is today possible and several 
commercial systems have been tested so far. The results of 
these tests show that some elementary and short English 
forms have been found by the users to be more convenient 
than natural language queries. A recent experiment [Ref. 3] 
for the evaluation of the desirability of natural language 
queries as compared to a formal query language like SQL 
cemeluded that: 

* SQL is for the moment superior to natural language. 

e Natural language is briefer than SQL. 

* Natural language does not provide user with adequate 

feedback. 
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III. THE DESIGN OF EFFECTIVE MAN-MACHINE INTERFACES 


A. DESIGN STEPS 


For the design of the man-machine interface several 
ideas may be considered and there are several steps that the 
designer should go through [Ref. 4:pp. ie Before 
proceeding further, it is useful to look at the design steps 
of the man=machine dialogue to provide a context for the 
discussion of various user and dialogue aspects: 

* Organization of information flow: To select a configu- 
ration for information exchange between the user and 
the computer. To see for example, if the user will 
communicate with the machine directly or through an 
experienced operator. 

¢ Selection of a general dialogue category: To determine 
a category. To see for example, if a programming 
language is going to be used instead of a limited 
English input (see Table I). 

* Hardware requirements specification: To relate dialogue 
requirements to system configuration. 

* Dialogue structuring: To determine various formats and 
other details. 

¢ Establishment of error procedures: To establish proce- 
dures for minimizing the likelihood of errors and when 
errors occur to maximize the possibility of detecting 
them. To provide also means for minimizing the conse- 
quences of machine errors. 

* Dialogue simulation: To assure that the dialogue will 
work well with the specific users. 

¢ Dialogue immunization: To make programs invulnerable to 


lrrational user behavior. 
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TABLE I 


Dialogue Categories 


Artificial Programming Languages 
Advantages ‘be clse , MeonGcisie 


Disadvantages : Inappropriate for users with low 
programming experience. 


Natural Languages 


Advantages ;: Easy to learn and use. 
Disadvantages : Unsuitable where precision 1s 
ec Ud Gel . 


Limited Natural Language Input 


Advantages : at papal A with the vocabulary. 
Disadvantages : Overestimation Of —Jthe real 
understanding capabilities OF 


the system. 


Computer-initiated Dialogue 


Advantages Mell@ebemiset training, very low 

errome possibility. 
Disadvantages : Dialogue is Long and very slow. 
User 1s restricted to a finite 


number of predefined options. 


User-initiated Dialogue 
Advantages >: Simple for the user. 


Disadvantages : Low reliability, limited scope 
and flexibility 


Light-pen Input 


Advantages >: Simple, good for inexperienced 
users. 
Disadvantages : Limited scope. 


Graphics with Charts Displays 


Advantages >: Effective representation of the 
substance of the information. 
Disadvantages : Expensive. 
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B. THE NEEDS Ofte vou 


It is very important in the initial steps of the 
dialogue design, that user needs and characteristics will be 
identified. These will provide a frame on which to taylor 
the dialogue. It is possible that more than one dialogue 
type and special feature of each type will be found to be 
appropriate and that the designer should consider all 
possibilities before deciding upon a particular type and 
structure. 

By examining various user types which are frequently 
mentioned in the literature [Ref. 5], five types are usually 
refered to: the casual users, parametric users, analysts, 
application programmers and systems programmers. These types 
are classified according to (i) the ability to handle the 
system, and (11), by how often and how long they make use of 
it. It is important to notice here that user types should be 
seen as a set of information needs rather than as a set of 
persons. 

Another area we should look at is the user's behavior at 
the interface. In doing this we consider the factors which 
affect this behavior from two viewpoints: 

e The user as a human being who has certain needs. 
e The user facing the system. 

Beginning with the first, user has needs which must be 
Satisfied. Under this assumption the user probably wants to 
be in front of a user-friendly system which does not force 
him to act in a strictly predetermined way and does not 
restrict him in his communication. 

User's behavior in front of the system is affected by 
his data processing and application background, his attitude 


towards the system, his objectives and the task structure. 
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C. USER ORIENTED OBJECTIVES OF A MAN-MACHINE INTERFACE 


The primary concern is always to design an interface 
whose goal is to adapt the computer to the user and not to 
force the user to adapt to the computer. 

Considering the user as a human being the objectives 
shall be directed toward his or her general needs, skills 
and characteristics: 

¢ Systems behavior towards user must be flexible so that 
user will be provided with a certain degree of freedom 
in his actions. 

¢ The system must be adaptable to the various user types 
and interaction modes. 

° User must not be concerned too much with system's 
behavior. 

¢ The system must never cause frustration to the user. 

e The man=-computer interaction must be as similar as 
possible to the communication between humans. 

« The system must be able to interact with the average 
user who does not have special abilities. 

° System's behavior must not be contradictory, so the 
user is allowed to easily get accustomed with system's 
operation. 

¢ The dialogue must be performed in a way which enables 
user to gradually pass into a more sophisticated type 
of dialogue. 

e The system must be as insensitive as possible to errors 
made by the user and must not introduce errors by 
itself. 

Considering the user facing the system, the objectives 
Shall address the factors which affect his behavior: 

* System's use must be possible for users who do not have 


special education. 
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¢ The system must be able to interact with users having 
different levels of experience. 

* Different systems must have interfaces which are stan- 
dardized. 

¢ System must not cause disappointment. 

¢ System's handling must not turn user's attention from 
the actual problem that he uses the computer used for 

¢« System's use must not be restricted to specific cases 
but it must allow the user to modify the communication 


according to the requirements. 


D. TECHNICAL ISSUES OF MAN-MACHINE DIALOGUE DESIGN 


The term man=-computer dialogue means the interactive 
exchange of messages between the user and a dialogue system, 
according to an agreed dialogue language anda dialogue 
category, to accomplish a desired task. A basic principle 
of the man=-computer dialogue is that the user has _ no 
restrictions on the selection of input and system's reaction 
is conclusive. The means for the exchange of messages 
between the man and the computer are the dialogue modes. The 
five best known dialogue modes are 

° Commands. 

¢ Menu. 

e Form eta Liang. 

e Yes/No questions. 

* Natural language interaction. 

The properties of a basic dialogue model have been char- 
acterized [Ref. 5:pp. 47], as follows: 

¢ In order for a dialogue model to be applicable, this 

model must be considered separately from the following 
details of 
=- Distribution “of roles of dialog initiator and 


dialogue responder. 
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- Special subject matters and objectives of dialogues. 
- Special input/output techniques and devices. 

°e For the different addresses of the dialogue 
descriptions the modes must be 
- Representable in a simple and expressive way (user). 
- Accessible to formal treatment (theorist). 
- Appropriate IBre ne transformation Wbghiete: programs 

(implementor). 

Petri-nets which were developed for representation of 
non=-sequential processes, meet the requirements of the 
previous model satisfactory. They are considered very good 
for the representation of the different faces of the 
dialogue. Refinement and generalization of nets allow repre- 
sentation of certain dialogue aspects in the desired level 
of detail. 

The more general level of the dialogue concerns’ the 
representation of the cooperation between information 
processing units and information communication units. 
Information processing units are the initiator and the 
responder/recipient of information and information communi- 
cation units are the channels through which the information 
is transmitted. The next lower level is concerned with 
dialogue processes which are described in general. The last 
level, describes processes in detail and illustrates rela- 
tionships between actions of dialogue participants and the 
messages which they communicate. 

The basic unit of dialogue between two participants is 
the dialogue step which consists of processes and messages 
of the one participant followed by the processing and 
messages of the other. Petri-nets allow us to represent 
sequences of dialogue steps. The different dialogue types 
can be characterized from the role of action and reaction in 
the dialogue step. Combination of distinguishing features 


merow the outlining of the basic structure of all the 
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possible dialogue steps. Description of the messages by 
which action and reaction are represented helps to distin- 
guish among various dialogue types. A sequence of dialogue 
types specifies a particular dialogue method. 

The human uses the computer for a certain purpose. 
He/she formulates tasks for the computer and the computer in 
turn corresponds to the assigned tasks which reflect the 
subgoals which the user sets for resolving a particular 
problem. The computer executes an indicated task by a set of 
activities which user cannot see. The user only knows the 
set of functions of the system which are realized by means 
of those activities. The system's functions are the 
abstraction of tasks and activities. When the system 
chooses a task the user is directed from the available func-= 
tions. When he enters his task the computer identifies the 
required function and executes the corresponding activities. 

There are six categories of functions [Ref. 5:pp. 60]. 

¢ Dialogue organization functions : help to organize the 
session at the terminal. 

* Control functions : enable the user to control ame 


processing sequence of his tasks. 


* Input/foutput functions : apply to the user's terminal. 
¢ Help functions : provide help in case of difficulties. 
¢ Communication functions : enable the user to interact 


with other users of the system through the terminal. 
¢e Extension functions : help the user to perform similar 
complex tasks which are repeated very often. 

To support user's orientation during the dialogue, the 
system must be able to remember the past dialogue and know 
about the available functions and the actual dialogue type. 
In other words the system must know the dialogue state and 
the data which the dialogue handler has stored. The user 
actually is concerned only with the available functions. 


The system's design must allow the user to choose the 
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dialogue mode and to get information about the dialogue 


past. 


E. ERRORS IN MAN-MACHINE DIALOGUE 


Error occurrence is a common case in man=-machine commu- 
mication. This happens even with the most experienced 
users. Errors are not usually considered as something 
special but they very often have very serious consequenses 
to both the system andthe user. A great part of these 
errors come from the lack of sensitivity on the part of the 
museecn Gesigner to the needs and abilities of the user. 

Errors often belong in two broad categories. The first 
includes the errors concerning the erroneous implementation 
of a correct user intention. The other includes the errors 
concerning the implementation of an incorrect user inten- 
fron. If we examine those errors with respect to their 
sources we will identify several classes which exist. 

A large class of errors concerns the cases where the user 
believes that the system is in one state while it is actu- 
ally in another. An example is the case where users give 
executive mode commands while they are in the edit mode. 
Another class of errors includes those errors where the user 
does the right operation in the wrong thing. They occur when 
ambiguity exists about which is the proper action and are 
caused when the action is not adequately specified. An 
example here, is the errors which occur in some text editors 
when some lower-case letters may cause the execution of 
different commands from those caused by the same upper-case 
letters. Some other errors are these which are caused when 
the user causes anaction different from the intended 
because of insufficient knowledge about the proper operation 
he tries to derive the operation by analogy with something 


Similar. The last class of errors are these where the 


Zo 


desired action cannot be carried out because of some hard-= 


ware failure. 


EF. GENERAL RULES FOR INTERFACE DESIGN 


This section presents a set of general rules based on 


what we have seen so far. Of course these rules are not 
absolute and in some cases they may overlap each other. The 
rules address two major goals of the interface design ; to 


Minimize the mental load of the users and to provide the 
capability of early error detection, minimization of error 
consequences for both the system and the user and cGasy 
recovery: 

¢* MininimiZing the mental load of the user: 

- All terminology and procedures should be uniformly 
available and consistently applied. 

- All information that the user needs should be easily 
and quickly accessed. | 

- The information should reflect the actual use ofMaam 
system. 

- Users should have some moderate "hands on" experience 
from some type of a model system before using their 
own. 

- Users should be given feedback about the state of the 
dialogue as often as possible. 

e¢ Minimizing the error correction and recovery effort: 

- System should provide means for validating data upon 
entry. 

- User should be allowed for backtrack through the 
dialogue sequence. 

- Different categories of actions should have have 
dissimilar command sequences. 

- The actions which may have serious consequences 
should be difriveult to ldo. 
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- System should be tolerant of both its own and user 
errors. 

In aa attempt to derive some possible general rules for 
the design of an effective interface, we examined the three 
Major aspects for this task ; the user, the dialogue mecha- 
nism and the errors in the interaction between man and 
machine. Some general rules were at the end presented, based 
on the analysis of those three aspects. 

These rules are by no means unique or absolute. 
Man-machine communication is a very sensitive and complex 
research area which consists of as the subject of interest 
not only the computer, but also and other sciences as cogni- 
tive psychology, ergonomics, etc. The fact that a human is 
involved as one component of the dialogue and the machine as 
the other makes it very difficult to create rules which are 


absolutely tailored to every type of communication. 
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IV. CONCEPTUAL RECOMMENDATIONS FOR MAN-MACHINE INTERFACES 

A goal in the design of effective man=-machine interfaces 
of the modern computer systems is that they will address the 
average user. This implies that they will be designed for 
users who are neither novices nor experts at computer 
handling. The goal of the design is to present an interface 
which takes into account the characteristics, skills and 
information needs of this type of user and his or her 
possible behavior in front of the machine. 

Prior to examining the role of natural languages in the 
design of "humanized" interfaces we should look at the 
understanding, or in other words, process of communicating 
ideas between man and computer. When we say man-machine 
communication we mean the communication of ideas between a 
human and a program, which is in the computer in order to 
accomplish a specific purpose. In this process the ideas in 
the human mind are generators of statements which in turn 
communicate to a program which translates them into appro- 
priate actions. In this context, a statement is said to be 
understood with respect to a given language and a target set 
of actions. In order for a statement to be understandable it 
must be associated with a single set of actions. 

This is the point where the problems with the use of the 
natural language start. These problems come from the fact 
that more than one set of actions may be related to each 
statement. This implies more than one meaning for the same 
statement and this is the source of ambiguity. This requires 
that there be one-to-one relationships between programs and 
target sets of actions and in many cases these sets may be 
so broad that it is impossible to consider the entire user's 


statement. 
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In order to decide if any language provides an appro- 
priate interface, it is necessary that the degree to which 
this language addresses the characteristics and needs of the 
user be taken into account. The role of natural languages in 
the effectiveness and efficiency of man-machine interfaces 
is examined here with recpect to various user and other 
general attributes 

Two principles for the Man-Machine Interface have been 
identified; first, the interface must not force the user to 
adapt to the computer and second, the interaction must be 
designed having an mind the average user who has not any 
special capabilities in handling the computer. 

There are though, many other questions which the 
designer of a man=-machine interface should answer; What are 
the capabilities of the user? What is his/her data 
processing background? How is he/she expected to behave in 
front of the system? How often will he/she use the terminal? 
How intelligent is he/she? How much must he/she be trained? 

All these characteristics will help the designer to 
address the major issues in the interface design, like the 
following: 

¢ Functions which must be provided at the interface. 
* Dialogue categories that user can use. 


e Levels of dialogue that user is going to use. 


A. REALIZATION OF MAN-MACHINE INTERFACE OBJECTIVES 


several objectives have been defined for the man- 
Machine interface. Bor further discussion on how the 
objectives will be realized it was found appropriate that 


the most representative among them had to be discussed: 
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Ease of Learning 


User must be able to learn the dialogue by himself 


without special assistance. This objective addresses the 
novice user. Learning the interface asks for the satisfac- 
tion of the following requirements: 


e 


The first thing that user needs to know is the func- 
tions of the interface. 

The user must be first introduced to the basic func- 
tions. Then according to his/her desire and experience 
he/she may pass ina more sophisticated level of 
functions. 

A good practice is that the user is not "flooded" with 
unnecessary information and he/she must be left the 
initiative to request additional information when he 
considers it necessary. The creation of passive users 
should be avoided. 

The output must be presented in an understandable and 
meaningful form where all subjects are easily 
recognizable. 

The number of rules which govern the dialogue and which 
the user must memorize should be as small as possible. 


The most attractive characteristic of natural 


language is that it makes learning of interface very easy. 


ment 


Fasesom Use 


The system must not create confusion or disappoint- 
in the user and it must be easy to handle: 
The first thing that every user wants to know is infor- 
Mation about the state of dialogue. 
A large number of help functions must be provided. 
The request for information by the user and the 
system's response must resemble the natural language as 


much as possible. 


34 


¢ There must be a fairly large number of functions which 


allow the user to dominate the system. 


See Lexibility 


The system is adaptable to the user and task 
characteristics: 
° The system provides a large number of functions. 
¢ The system provides a variety of dialogue types. 
* The user is able to switch from one dialogue type to 
another and create the desired level of sophistication. 

An important notice at this point is the trade-off 
between flexibility and complexity. 

The lack of strict rules in the use of the natural 
language makes the user feel comfortable in front of the 
machine. This benefit though, very often becomes an illu- 
sion if the language makes use of "legal" and "illegal" 
statements. This means that the user is indirectly forced to 
learn the boundaries of the language use. For example, a 
natural language database querying system may consider as 


legal the statement 


Give the names of all CO's of USS Enterprise 
Since 1980 


whereas the same system may consider the statement 


Give the names of all CO's of USS Enterprise 
the last 5 years 


as illegal. The query in this example is actualy satisfied 
because there is at least one statement which is acceptable. 
The problem starts at the point where the user must learn 
which are the illegal statements. So natural language is 
desirable if the subset which has been chosen is’ broad 
enough and the users are not obliged to learn which state- 


ments are legal and which are not. 
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4. mans passener, 


System's behavior is predictable; the user can 


predict what information the system 1s going to provide him. 


So Reliabiiiey 


The system is designed ina way which allows the 
user to conduct a dialogue in reliable way and this asks for 
the satisfaction of the following requirements: 

¢ System's behavior must allow the user to know every 
possible effect of his/her input to the system. 

* System's reaction to possible errors must not affect 
the continuation of the dialogue. 

Although natural language is not governed by such 
strict rules as the artificial languages it in reality may 
push the user to make several kinds of errors. One charac-= 
teristic category of such errors is that which includes the 
errors caused because of the similarity in the expression of 
various commands in natural language and this will make the 
user get confused about which is the appropriate command to 


use 


6. Precision Requirements 


The problem of ambiguity is an inherent attribute of 
natural language statements. The reason is that these state- 
ments seldom stand for a single meaning. In cases where high 
precision is required ambiguity makes natural language 
interfaces inappropriate. The problem of ambiguity may 
somehow be resolved by combining natural language with some 


artificial language. 
7. ~Beedbachs tomenomuscd 


The lack of precision which causes so many problems 


in the understanding of the statement is the source of 
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another problem; it is not easy for the system to provide 
feedback to the user about errors in dialogue or the exact 


reason why the system cannot understand the statement. 


8. Picture Description 


A very important form of man-machine dialogue is the 
communication through pictures. The media for this type of 
communication differ in their effectiveness and in the type 
of information they carry. An obvious weakness of natural 
language is the inability to describe shapes, positions and 
graphs. So the natural language cannot substitute for media 
like charts, tabulations and other screen representations. 
Programs which manipulate graphical objects may have natural 
language interfaces only if they are integrated with appro- 


priate graphics based communication tools. 
9. Interface Standardization 


In implementing the natural language interface it is 


necessary that each program must be associated with a single 


target set of actions. This implies that a great number of 
these sets must exist. This does not favor the 
standardization of the interfaces and “Mmanimizatwvon ‘of 


training requirements for the users. 


£0. User Experience 


The use of natural language implies that the opera- 
tion of the interface is independent of user's experience in 
a specific artificial language. The same does not happen 
when an artificial language has to be used. In this case 
user's experience with the specific language has a signifi- 
cant impact ali the effectiveness On man-machine 


communication. 
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ll. User Behavior 


The more the language used as interface resembles 
the natural, the more comfort the user feels facing the 
system. There is though, a trade-off between user's conven- 
lence and the other aspects like ambiguity and complexity 
and consequently the cost of implementation. This of course 
does not exclude the case where an experienced user may use 
a "telegram" type dialect and does not feel very good when 
he must use natural language as it is used for conversa-= 


tional purposes. 
12. Cost of the Interface 


The design and implementation of natural language 
interfaces is in general more costly than any other inter- 
face. This cost is related to the degree of resemblance of 
the language used as interface to the natural language. As 
language becomes more restricted and formal, the cost of the 
interface decreases. The reason for the high cost of the 
natural language interface is related to the difficulty in 
making the machine understand the meaning of the human 
statement. In other words the ambiguity is the most signifi- 
cant factor in the cost of natural language interfaces. 
After all this, the use of such interfaces must be consid-= 
ered with care and there must very serious reasons to ignore 


the high cost associated with such a decision. 
TS SeMameher Complexicy, 


From what we saw so far we may initially conclude 


that the implementation of natural language interfaces is 


very troublesome and is not worth consideration. This is 
not ageneral rule. There are cases where the use of 
natural language is suggested. All these cases concern the 


Situation where the number of possible different messages 
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which user wants to communicate is so big that the artifi- 
cial languages have not the power to accomplish this task. 
Under these conditions a language interface which is 
adequately complex to handle the variety of messages is 
recommended. 

There is a close relationship between the complexity 
of a program and the range and complexity of the interface 
to the program. This connection may often lead to the 
pitfall of "“oversophisticated" interfaces as they are 
conceived by the user, and in this case the user may believe 
that the interface has unlimited capabilities and can answer 
any question. 

English of course, ie nee the only “natural” 
language which may be used in man-machine interfaces. 
French, German or other languages may be candidates for the 
same purpose, but they should be expected to cause the same 
trouble . The reason is that certain grammatical character- 
istics like inflection of word endings, declensions and 
conjugation will only be recognized in context. Perhaps the 
invented languages like Esperanto would be better for the 
same purpose. 

The main difficulty with any natural language is 
that it probably is not as logical as programming languages 
and this same attribute implies that artificial languages 


cannot be used for conversation. 


PeeeOUNCTIONS PROVIDED AT THE INTERFACE 


It is desirable that the number of functions provided 
will be as large as possible, so that the user will not be 
restricted during the interaction. The number of these func- 
tions characterizes the flexibility of the interface. Each 
of these functions plays its own important role for’ the 


effectiveness of communication between the man and the 
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computer. In [Ref. S:pp. 42], six categories of functions 


are distinguished: 


i input /Oue DUI sue Eons 


They are absolutely necessary because they provide 
the actual means of man-machine interaction and for this 
reason they must be well designed and easy to understand. 
In the realization of the various dialogue categories two 
major aspects have to be considered. These are related in 


[Ref. 5:pp. 109] to the system's input and output. 
a. Input considerations 


For the input the designer must consider’ the 
following: 

(1) The vocabulary. Having in mind ~jau 
possible levels of user's experience, a recommended vocabu- 
lary for the novice user would resemble that utilized in the 
natural language. It will become thereafter, more and more 
restricted and stylized as user's experience with the system 
grows, ending ina dictionary for a highly professional 
dialect. 

(CZ ) ee yilediGh leans mUcion The number must beware 
small as possible, so that they will easily be learned and 
memorized. They must also emphasize the accuracy and 
completeness of the entry. 

(3) Format rules. They are in general undesi- 
rable. The user has already the mental load of memorizing 
syntactic rules and there is no reason why this load should 
be increased by having him memorize additional rules. 

(4) Feedback. The system, must clarify any 
problems caused by incomplete or erroneous input and guide 


the user in locating the point of the problem. 
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b. Output considerations 


The design principle for the output is that the system will 
provide the user with the relevant of information and in 


desirable .form. 


(1) Ibe vocabulary. It should include natural 
or professional words. Such a dictionary is desirable to 
users of any level of experience. ‘This 1s a necessary 


requirement to ensure that the output will be meaningful and 
understandable. Such a requirement cannot be satisfied by a 
vocabulary which is mostly composed by symbols, abbrevia- 
tions and formal expressions. 

(2) Output syntax and format. It must reflect 
the requirements of the actual output andthe important 
parts must be easily recognized. 

(3) Variety of representation. It must be 
possible that the output will be presented in a variety of 
forms so that the user will be able to select the form which 
fits to his/her needs best. 

In this context, it is desirable that we 
shall identify another aspect. This is the need that the 
output will reflect facts and objects from input. This will 
enable the user to figure out how the results are related to 


the input data and how the information in hand has'7 been 


produced. 
wee Control Purctions 
They provide a mechanism for controlling the 
dialogue. They are separated in functions which are of 


interest to any user and the functions which address 
experienced users and and special tasks. Necessary control 
functions are those used to start , interrupt and terminate 


the dialogue. 
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3. Helo Bumeeieric 


They provide the user with special information about 
systems characteristics or the dialogue itself. An example 
of an interface which provides the user with adequate 
support, is that one in the Defense Data Network (DDN) 
[Ref. 6]. When the user wants to see which functions and 


commands are available he has just only to type: 
HELP ? 


and the system will provide him with a list of functions, 
commands and other features which are available. After this 
the user can request information about a command for example 
"LOGON" by typing: 


HELP LOGON 


and the system will provide him with information about how 


tf 


to "log on" the system. 


4. Reorganization Functions 


They enable switching from one dialogue category to 
another. This is a necessary feature for the interface. 
Although the primary concern of the design is the average 
user, different levels of experience will exist within the 
user community and, even more, a specific user's experience 
will continuously increase and Heya will probably want to 


be able to pass into more sophisticated levels of dialogue. 
5. Communwea tion Funetions 


The integration of communication functions in the 
system depends on the requirements of each particular appli- 
cation. Sometimes they are necessary, sometimes not. It is 
believed that they should be included in the interfaces of 


any contemporary information system. This comes from the 
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fact that the requirements of the information systems in the 
80's decade ask for integration of data processing and 


communications. 
&.. Extension Functions 


They may be seen aS a special category of control 
functions . They are suitable for use by experienced users 


in sophisticated tasks which are performed routinely. 


C. DIALOGUE CATEGORIES 


Io organize the dialogue categories we should first 
distinguish several user categories in relation to their 
data processing experience. These categories are the novice, 
intermediate and experienced user. It is possible that all 
these categories will coexist within the same community. 
This is a result of the personnel turnaround which more or 
less is a problem for any organization. It is expected that 
an effective man-machine interface will cover the needs of 


users of any experience. 
1. Menu-driven Dialogue 


The system proposes task options to the user. The 
user can choose the option which he/she feels that fits 
better with his/her needs. This type of dialogue is appro- 


priate for novice users. 


Zz oystem-initiated Requests 


In this type, the system asks the user to make an 
input simultaneously providing the user with the necessary 
syntax for the input. If the input is not complete or 
correct with respect to the specification of the task, the 
system may respond in one of the following ways: 


e Request for the user to make another input. 
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¢e Provide the user with the available alternatives to 
continue. 

* Identify the incompletely or erroneously specified 
subjects. 

This type of dialogue is appropriate for users of a 


certain level of experience with the specific system. 
3. User-sinitiated Commands 


Highly experienced users may prefer to specify their 
tasks in a highly stylized and formal fashion through the 
use of commands. System intervention is restricted in the 


specification of incomplete or erroneous information. 
4. Natural Language Dialogue 


This type of dialogue may be user-initiated or 
system-initiated. The vocabulary used resembles that which 
human beings use to communicate their ideas. The natural 
language dialogue imposes many implementation problems which 
increase when the user is the initiator of the dialogue and 
the degree of approximation to the natural language. 
Inexperienced users will probably desire such a dialogue. 
The difficulty of implementation makes it inappropriate for 
input, but suitable for guiding the user in the dialogue for 
the obvious reason that it 1S meaningful and easy to 


understand. 
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V. NATURAL LANGUAGE IMPLEMENTATION 


A. THE UNDERSTANDING PROCESS 


People want to communicate with the computer because 
they want the computer to do a task for them. In this task 
the computer may be "active", providing its services at the 
time of communication, or "passive" when it simply stores 
something for later use. 

At the beginning, an idea is generated in the user's 
mind about the action he/she wants the computer to do. The 
understanding process has to be translated into a statement 


in a source language that the computer is able to under- 


Stand. The statement expressed in source language is then 
entered into the computer. The computer must then "under- 
stand" the statement and do the appropriate actions. By 


"understanding" we mean that a certain program has to find 
which actions are being asked through the statement; this 
implies that the statement is meaningless for the computer 
if the computer cannot associate the statement with a set of 
actions. 

This point is the beginning of all problems associated 
with natural language implementation and the reason is 
Simple; itis very difficult to provide the computer with 
the capability to find which actions are related with a 
statement expressed in natural language. In other words it 
is difficult to make the computer understand the meaning of 
statements expressed in natural language. This liability of 
natural language statements to more than one interpretation 
is the so called ambiguity. It is the main reason which 
generates the requirement that in order for the natural 
language to be efficiently used as man-machine interface has 


to be "problem specific". 
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Although man-machine communication is a two-way process 
we will consider here only the direction from the user to 
the computer because we are interested in the design princi- 
ples of a system in which the computer will understand 
user's statements and this is believed to be the primary 


issue of natural language implementation. 


B. COMPONENTS OF A NATURAL LANGUAGE UNDERSTANDING SYSTEM 


In studying the components of a natural language 
understanding system we have to distinguish two cases; 
understanding of single statements and understanding of 


text or dialogues. 
1. Sentence Understanding 


Sentence understanding is accomplished in three 
steps: the lexical analysis, syntactic analysis or parsing 


and semantic analysis [Ref. 7:pp. 41]. 
a. Lexical analysis 


In this step, the statement is divided into its 
component words. Dividing a sentence this way is not diffi- 
cult and it can be performed by scanning each sentence as it 
is in the input. The time required for this "scanning" is 
proportional to the length of the sentence. In this context, 
word is defined as a string of characters between two 
blanks. Of course when spoken language is used there 1s no 
way to see any words or spaces; in this case the lexical 
analysis requires information about the conditions under 
which the event occurs. 

Under certain circumstances it is required that 
a word will be further subdivided into a "root" and a prefix 
or suffix. This kind of segmentation together with the rules 
about how the pieces have to be combined to form a word 1s 


called morphological analysis. 
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The list of words that can be recognized by a 
specific understanding program is contained in its 
dictionary or lexicon. Every understanding program has its 
own lexicon. The diversity of ways for word representation 
in the dictionaries from one program to another makes the 
use of the existing commercial dictionaries inefficient. 

The selection of a lexicon requires two stages; 
earst, the concepts which have to be expressed must be 
isolated, and second , the appropriate words which represent 
the concepts have to be selected. The isolation of the 
concepts can be done by looking at the actions that’ the 
program can do. In selecting the lexicon another require- 
ment is that a distinction between a category of words in 
which addition of new entries is possible and a category of 
words where addition of new words seldom occurs, has to be 
made. Users are allowed to modify the arguments of certain 
actions , but not the actions by themselves. For example, in 
the statement 


CALCULATE NET INCOMES FOR 1982 


the word "1982" is a value for the "argument" year. User is 
allowed to change this value according to his/her desire. 
The ambiguity in a natural language interface is 
related to how big is the subset of the natural language 
used at the interface. The use of the entire set of natural 
language creates such an ambiguity that implementation of 


the interface becomes infeasible. 
b. Syntactic analysis 


Natural language statements are prior to any 
analysis, simple strings of words, but they correspond to 
structured ideas. The purpose of the first step in the 
understanding process is to find a structure which corre- 


sponds to the structure of the meaning of the statement. The 
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process of assigning a structure to an unstructured subject 


1s called syntactic analysis or parsing. The inputs for 
which parsing is possible are considered to be "legal" 
whereas the rest are considered to be "illegal". We want an 


effective man=machine interface to provide uS with as muem@ 
flexi bidiagy dn pavsiicmecomposs lt ble. 

The parsing process makes use of a set of rules 
which govern the means by which high level structures can be 
generated from lower level ones. The collection of these 
rules is called grammar. The higher structure which corre- 
sponds to a specific complete grammatical statement is the 
sentence. The way in which statements are created by their 
components can be represented with a tree diagram which is 
called a parse tree. An example of a parse tree which demon- 
strates the formation of the statement from its components 
1s shown in Figure 5.1. 

There are two approaches to the parsing process: 
the top-down and bottom-up. In the top-down approach the 
parsing starts with the sentence and continues by hypoth- 
esizing the lower components. This hypothesization continues 
until actual words are needed. The words are then compared 
to the actual input. In the bottom-up approach parsing 
starts from the bottom with statement words and continues 
upwards. When the components of each intermediate constit- 
uent become available, the constituent is formed and the 
process continues until the top (sentence) has been formed. 

Augmented transition network (ATN) [Ref. 8] is a 
very common approach for encoding the grammar. The network 
corresponds’ to the allowable transitions between the 
constituents of the statement. The arcs in the network guide 
the parser to take the appropriate actions during the 
Parsing process in order to end up with the desired struc- 


ture. An example of an ATN is provided in Figure 5.2. 
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SENTENCE 


NOUN! PHRASE 


VERB PHRASE 


VERB 
want 


iinaieial PHRASE 


Rese 
to see 


~ NOUN PHRASE 


NOUN 
article 


PREPOSITIONAL 


OUANTIFIER 
an. 


spenoalaeen 
about 


Eacuisemiorel Parse Tree. 





PHRASE 


NOUN 
ADA. 


START ==> noun phrase ==> prepositional phrase <--> END 





Figure 5.2 


Augmented Transition Network. 


In addition to the 
plays an important role in par 
sequences of the same words 


For example, the sentence 


lexical analysis, ambiguity 


sing. In this case multi-word 


are derived from the grammar. 
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Show all the articles about software requirement 


analysis and design or maintenance 
is ambiguous. It may be a request for articles about 


software (requirement analysis and design) 


or maintenance. 
It may be a request for articles about 


software requirement analysis and (design 


or maintenance). 


as well. 

Ambiguity in parsing, implies that more than one 
analysis is required for the same sentence. Sager [Ref. 9], 
suggests the addition of restrictions concerning the 
acceptable word combinations, to help the final syntactic 
analysis. Another possible solution is that the ambiguities 
will be resolved in later stages or else the user has to be 
asked about what is intended. It is possible that the same 
language will have many different grammars. These grammars 
May assign different structures to the same statement This 
implies that their degree of usefulness for a particular 


understanding program varies. 
c. Semantic analysis 


It is the most important step in understanding 
process. In this step the statement is assigned a meaning. 
In other words the actions which the program has’ to take, 
are determined. The semantic analysis is accomplished in two 
stages; the semantic and pragmatic processing. In the 
semantic processing the meaning of a sentence is determined. 
In pragmatic processing the action which the target program 


has to take, is established. For example, in the statement 


List all the customers in L. Angeles 
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we see that the meaning is related to the customers of L. 
Angeles. The action that the program has to do is to list 


those customers. 
2. Text and Dialogue Understanding 


In many cases understanding individual statements by 
a program is not enough. It is required that the system 
understands text. Several new problems emerge in this case. 

The most important problem in dialogue understanding 
is that in the context of the dialogue, individual state- 


ments are very often incomplete. For example 


User > Whewewes the length of USS Corry? 
Computer: LENGTH 396 FT 


User >: What is her draft? 
Computer: DRAFT 19ET 


User : Her maximum speed? 
Computer: SPEED 33 KNOTS 


In this example the statements #2 and #3 are incomplete. 
The statement in step #2 makes use of the anaphoric pronoun 
"her" in order to refer to a subject which has already been 
mentioned in the first statement. In order for the system to 
understand the second question the anaphoric pronoun "her" 
meee be tied to "USS Corry". ingmorder for the third 
statement to be understood, the words which describe the 
information which the user looks for must be placed in the 
previous statement into the right slots. In this example 
this means that a query "what is her speed" will be 
constructed from the statement "her speed" and it will 
finally take the form "what is the maximum speed of USS 
Sentry? 

The number of tasks which require natural language 


input which is greater than a sentence increases continu- 
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ously. Many of these tasks require programs that can deal 
with informal and incoherent texts which are very difficult 
to understand particularly cases where idioms and metaphors 
are present. 

There are several categories of programs and systems 
which deal with understanding of text and/or dialogues in 


natural language [Ref. 11]. 
a. Expectation-based parsing 


The interpretation of a word meaning often leads 
to problems. A characteristic case is that one where the 
meaning of the word is associated with the concepts the word 


refers to. For example, in the statements 


George got a cold 
George got a book 


the word "got" refers to two different conceptual structures 
and finding the exact meaning of "got" is very difficult. 

The meaning of a word in such cases can be 
captured by use of rules which test the context of the 
sentence and decide about the meaning. In our case, when 
the word "got" is seen in the input, the test rules are 
activated and examine the rest of the sentence, expecting 
that probably the test may become true. In the opposite case 
an action is taken which assigns a structure to the state- 
merc. These rules can be prepared so as to cover any 
possible meaning of the word "got" or any other word which 
creates the same problem. 

Conceptual analyzers are tools which assist in 
the disambiguation of such words. The process of disambigua- 
tion consists of predicting likely meanings and picking up 
from the word's definition the sense that corresponds better 


to the predictions. 


Sys 


b. Script-driven parsing 


In this case, conclusions about a story are 
generated on the basis of an internal description of the 
story, a kind of "script", consisting of basic elements that 
normally occur in events of the same type as that one being 
reported. For example a script for a battle will have slots 
for the time, place, damages and victims of the battle. Any 
story about some battle will then be treated as an instance 
of the script. It can be noticed here that what changes is 


only the value of the slots. For example in the sentences 
The battle of El Alamein in 1943 had 9645 deaths 
and 
The battle of Rhein river in 1942 had 11233 deaths 


are instances of the same script. What changes is the value 
of place, time and victims slots. 

A tool which operates on this principle is the 
FRUMP conceptual analyzer [Ref. 10:pp. 42]. EFRUMP scans the 
first sentence of the text in input, looking for words or 
phrases whose meanings are referred in one of the scripts in 
his dictionary. When a script is picked up, attempts to 
fill up the script slots are made. These attempts are based 
on rules which determine where to look in the input text for 
words or phrases whose meanings might fill up the slot. 
FRUMP automatically ignores things that cannot fit into the 


Soni pt. 
c. Interest-<driven parsing 


In this approach the principle of script-driven 
parsing is maintained and is combined with the use of 
frames. Frames are smaller than scripts but more modular and 


are integrated with each other. The idea in the interest 
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driven parsing is that the more interesting frames when they 
refered to the sentence become the driving mechanism for 
understanding the sentence. For example, in a sentence 


begining like 
"A long duration earthquake........ 


the word "earthquake" will start several actions. First, the 
meaning of the word "earthquake" will be examined. A defini- 
tion of earthquake wants the word as the primary participant 
in a frame for natural disasters. This frame describes not 
only what an earthquake is, but also its cause and its 
results. The analysis of the sentence is then governed 
further by the frame for natural disasters which looks for 
pieces to fill the existing slots in the frame. Preceeding 
and following inputs which were not so interesting as 
"earthquake" are now candidates to be selected for filling 
up the slots. For example "tectonic" is picked up to fill 
the slot for the type of earthquake which participates in 
the event of natural disaster. In addition to thier 
predictions about the rest of the sentence may be done. The 


word "leveled" in the sentence 


A long duration tectonic earthquake leveled the 


city of San Fransisco 


is interpreted to mean "destroyed" because that is a reason- 
able prediction of what a long duration tectonic earthquake 
can cause. As the analysis progresses other frames may be 
picked up, because one piece of input may be referred in an 
interesting frame which is not currently active. The anal- 
ysis is then governed by these frames. 

A tool used for parsing based on interest-driven 
analysis, is the Integrated Partial Parsing (IPP) system 
[Ref. 10:pp. 44]. A particular characteristic of the IPP 


system is that it ignores some words like "be", "have", 
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"give", "get" and so on, because they are referred to low 
interest frames. This characteristic becomes a problem when 
low interest frames are referred by words of the text but 


the text is of great interest to us. 
d. Goal-<driven parsing 


The analysis of the text is performed in 
bottom-up fashion, using expectations. These expectations 
are found by lexical analysis of the words in the sentence. 
The system considers only the most recent expectations. This 
has two consequences. First, the number of the things that 
must be cheked is small, and second, failure of one expecta- 
tion can block other expectations. 

A system which is based on this principle is 
McMAP [Ref. 10:pp. 46]. It can handle several different 
texts with multiple sentences, by keeping track of all 
structures constructed so far and by using inference rules 
to tie all these structures together. The main advantage of 
McMAP is that it can check the reasonality of a given 


interpretation. 
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VI. THE DESIGN OF NATURAL LANGUAGE INTERFACES 

To design an acceptable natural language interface, we 
have to consider several crucial aspects: 

¢ The interface must be application specific. 

e The subset of the natural language which is going to be 
used, must be sufficient to support the particular 
aApDpLIcCAtion- 

e The interface must be to some degree transportable. 

The size of the natural language subset is a very impor- 
tant factor in the definition of the design approach. We 
distinguish three cases: 

¢ The number of statements which are needed to be entered 
for the accomplishment of a specific task is small may 
be implemented through a finite number of options 
presented to the user on a screen. 

¢ The number of the statements which have to be entered 
at once is large enough and they cannot be efficiently 
combined, because of the large number of possible 
combinations, to create a finite number of options to 
the user, but the subset used is small. 

¢ The number of statements which have to be entered at 
once is so large (text or complex sentences which have 
pronouns or are referenced between them) that in order 
for the system to be able to understand the meaning of 
the input, it must capture as many characteristics of 
the language, as possible. 

Another factor which has to be considered, is the appli- 
Cacion environment. We Cistingurvsh tour application 
environments: 

¢ Data retrieval 


¢ Natural language dialogues and text processing. 
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* Text knowledge systems. 


* Natural language programming. 


A. DESIGN APPROACHES 
1. Menu-driven Natural Language Dialogue 


This approach is indicated when the total number of 
related statements which will probably be entered is small. 
This allows the presentation of various options to the user, 
who in turn selects between them and constructs the state- 
ment as a combination of these options. 

In a possible implementation of this approach, the 


options are presented through a screen, as in Figure 6.1. 


COMMANDS: List Show List all Show all Show the 


FEATURES CONNECTORS UALIFIERS QUANTIEFIERS 
class and uilt by a 

name o£ all 

speed OC number 

fuel in 

crew at 


COMPARISONS NOUNS Air BU LES 
smaller than shi <name 
greter than station <station 
equal to mission <class 
Saulvalent to weapons <weapons 
different from <number 


QUERY STATUS : List all ships of class DDG 





Figure 6.1 Menu-driven Natural Language Dialogue. 
Rich [Ref. 8], calls this approach "Window method" 


considering each screen presented to the user, as composed 


by windows of information. 
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In the screen of figure 6.1 three windows exist. The 
user starts from the window which is first activated by the 
system. After the selection of one of the available words, 
the next window is activated, in order for the user to 
proceed with the next selection. Two things are important 
after each selection: 

¢ The screen must be updated with respect to the progress 
of the dialogue so far. 

¢ The windows which contain options for the next steps 
will be modified to present only the options which are 
are necessary for the user to proceed further. 

The first action is related to the objective for 
adequate feedback to the user with respect to the progress 
of dialogue. The second , is related to the objective that 
the user must be provided with just the information he/she 
needs to continues =sewatatogue andmiomaing more. 

It is believed that in order for such a screen to 
address user's needs, it should “contain at least 3a 
following windows: 

¢ Command window: It should contain command options for 
completing the statement, for interrupting and/7on 
terminating the dialogue, help and control commands. 

¢ Word window: It must contain any word which is expected 
to be used in the composition of some statement, except 
commands. 

¢ Dialogue progress window: It should contain any infor- 
mation concerning progress of the dialogue so far, for 
example the part of the statement which has. already 
been created. 

The implementation of this approach requires the 
following components: 

¢ Lexicon or dictionary or vocabulary: It contains the 


words which have to be recognized by the system. 
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¢ Grammar: It contains the rules that govern the forma- 
tion of the statement, from the words which are 
presented on the screen. 

In this approach, lexical analysis is not complex. 
The reason is that every possible constituent of input 
already exists and is provided automatically by the system. 
Parsing is performed as user's statement is entered and 
includes the building of the parse and updating the window. 
Semantic processing is easy and is performed while the parse 
tree is being built up. 

A very important characteristic of this method is 
that any statement composed of words which are provided as 
options will be any way correct and ready for syntactic and 
semantic analysis and the possibility of rejection by the 
system exists. Another important characteristic is that 
the user is in some way restricted to move onto a small 
number of predefined paths, but this is the trade-off for 
the error-free dialogue. 

This approach is suitable for cases where the 
semantic complexity of the domain is low and the number of 


the options which have to be provided is small. 
2. User-initiated Dialogue with Simple Statements 


This approach is indicated when the number of state- 
ments which have to be constructed is large and the subset 
of natural language used is small. If the number of state- 
ments is large beyond the point that allows the system to 
Maintain the dialogue by itself, then the previous method is 
not indicated. 

In this approach the user constructs entire state- 
ments, which are then entered into the system. The system is 
assigned the responsibility of understanding each statement 
and implementing the required actions. The idea behind this 


approach is the combination of parsing and semantic analysis 
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in a single step. So, the understanding is accomplished in 


two steps. The method implements a kind of grammar which 
contains rules for both parsing and semantic analysis. Two 
requirements govern the design of these rules. First, the 


rules are designed in such a way that the statements which 
can be successfully parsed correspond closely to the actions 
of the target program. Second, the structure, which is 
assigned by the parser to the statement, also corresponds to 
the target program. A problem which may appear with this 
approach, is that some inputs, which do not correspond 
closely enough to the target program, may be accepted. This 
occurs if the rules of the grammar are not sufficiently 
constrained. Another problem is that a correct statement may 
not be parsed if it is not associated with any of the 
specific sets of actions of the target program. This is a 
result of the situation where the rules of the grammar are 
associated to some set of actions. An example of a parse 


tree generated by such rules is shown in Figure 6.2. 


SENTENCE 


sited PROPERTY 


what is re ance: 


speed 





Figure 6.2 Syntactic Parse Tree. 
In this example the categories "aircaft" and "type" 


are designed to correspond to the target set of actions 


which obviously concerns facts and actions about aircrafts. 
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Here the similarity between the structure of the query and 
the parse tree can be noticed, which makes the production of 
the query easy. 

This method is suitable when the subset of the 
natural language is small. It becomes inappropriate when 
this subset is so big that some of the characteristcs of the 
Bemguage must be used in the understanding process. As the 
rules are expanded to deal with larger pieces of language, 
they become so complex that the implementation of the method 


becomes infeasible. 
3. User-initiated Dialogue with Complex Statements 


When a large part of the natural language is used as 
interface it is necessary to capture as many characteristics 
of the language as possible in the rules used to understand 
ater, A grammar which satisfies such a requirement maps 
structures of statements onto structures of target program 
actions. When both the assignement of a structure to the 
statement and the translation of the structure to target 
actions are complex then the understanding must be accome- 
plished into two steps. First, rules which allow the parser 
to assign structures, which capture various characteristics 
of the language of the statement are used. Second, other 
rules are needed for the translation of the structures to 
actions. 

Ihe difference from the previous approach is that 
two separate sets of rules are needed for understanding the 
input which is performed in two separate steps. 

The problem with understanding of complex sentences, 
is that regardless of the parsing strategy the components 
which will finally be used are very often difficult to 
determine in advance. The final choice of these 
constituents is a search process which may make use of 
various techniques, like those described in the previous 


chapter, to solve the problem. 
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To minimize the number of intermediate constituents 
which may be at some stage rejected, most parsing rules are 
augmented with tests that must be satisfied in order for the 
rules to activate. Some of these checks test for syntactic 
properties, others for semantic and yet others for specific 
properties. 

There are two ways to perform the third step, the 
semantic analysis. The first way is to inter-leave syntactic 
and semantic processing and associate with each grammar rule 
a set of actions which will be applied whenever the rule is 
applied. There are two reasons why this technique is very 
often inappropriate: 

* The need for backtracking cannot be satisfied once some 
actions have already been associated with the rules. 

¢ The objective of interface transportability cannot be 
accomplished because the parser and the grammar are 
difficult to be transferred in many applications. 

A second technique separates completely the parsing 
from the semantic processing and satisfies both these 


requirements. 


B. INTERFACE ARCHITECTURE 


We have seen _§ so far, that the size of the natural 
language subset is avery important factor for the defini- 
tion of the design approach. We have seen also that not all 
of the components of a natural language understanding system 
are equally important to each design approach and that it 
is also possible that a combination of the components will 


exist. 
nee [Reegt eeu hequiremenies 


In our attempt to define an architecture for the 


natural language interface, we shall first look at’ the 
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requirements which that interface must satisfy. The minimum 
requirements which the interface should satisfy, are listed 
below: 

° Flexible parsing: User's input must be acceptable when 
Minor linguistic errors exist. In addition, the user 
must be provided with adequate feedback relative to the 
cause of the errors and even more he/she must be 
provided with recommendations about the alternatives 
available for continuing the dialogue. 

* Adequate feedback: The user must be informed as often 
as possible about the actual state of the dialogue and 
what the system has understood. 

* Intension recognition: The system must be able to 
distinguish the meaning of similar expressions used 
within different contexts. This is a very important 
requirement for text understanding systems. 

These minimum requirements have to be fulfilled in 
order for the interface to be acceptable, but in order for 
the interface to be graceful too, the following additional 
requirements must be satisfied: 

e Transportability: The interface must be in some extend 
"application-independent". 

ferodifiability: User must be allowed to modify the 
components of the interface for example the lexicon, 
rules, etc. This allows the addition of new entries 
(knowledge acquisition) and/for improvement of its 
features. 

A natural language interface which has the capabili- 
ties of satisfying these requirements, addresses most of the 
objectives of the man-machine interface. There are, though, 


some objectives which the natural language interface does 


not address. These are the reliability and transparency of 
the dialogue. These objectives are very difficult to be 
realized by natural language interfaces. The primary reason 
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is the ambiguity in the natural language which exposes the 
user to many errors (unreliable dialogue) and he/she also 
meets difficulties in predicting system's behavior (lack of 


transparency). 
Ze Interface Characteristics 


Before examining the tools, which a natural language 
interface must make use of and its functions as well, we 
must decide about the scope of the interface. Two options 
are available: 

¢ The interface will be of limited scope; this means that 
it will be used for implementing one of the approaches. 

¢ The interface will be of broad scope; this means that 
it will be used for implementing more than one of the 
approaches. 

The advantage of a general purpose interface is that 
it addresses a larger number of the user’s characteristics 
and information needs. There is, though, a trade-off 
between the breadth of scope and the complexity and scope of 
the interface. The broader is the scope, the higher is the 
complexity andthe cost. A restricted purpose interface 
addresses a smaller range of the user's characteristics and 
information needs, but favors lower complexity and cost. 

The interface which 1s proposed is a general purpose 
interface with the following important characteristics: 

e It combines all the three design approaches. 

e It provides the capability of modifying the content of 
system's components. 

e It is transportable. 

Several implications come out of these characteris- 
tics and they are listed below: 

¢ The interface may be used for system-initiated 
dialogue, simple statement understanding and complex 


statement/text understanding. 
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* User's conceptual view is separated from the details of 
the particular application. 

¢ Not all of the components will be important for any 
case of input understanding. 

A general overview of a transportable natural 
language interface for data retrieval 1s presented in 
[Ref. 4]. This interface is the basis for the general 
purpose interface presented in figure 6.3. 

The new features which have been added to the inter- 
face are the following: 

¢ Complete separation of the natural language portion 
from the application portion. 

¢* A means of cooperation between the natural language 
part and the application part. 

e¢ A means for extending the existing knowledge bases. 


¢ A means for extending or modifying the sets of rules. 


3. Description of the Proposed Interface 


The interface consists of two portions; the natural 
language portion and the translation portion. LneLtr joinie 
task 1s to process any natural language input, to understand 
it and generate a kind of code which represents the meaning 
of that input and consequently the interpretation of user's 
intentions. 

The user can select one of the following options as 
far as the type of dialogue is concerned: 

¢ System-initiated natural language dialogue. 

e User-entered simple statements. 

¢ User-entered complex statements and/or text. 
By specifying his/her option via menu selection, the 
dialogue controller activates the components which are 
necessary for the realization of the dialogue. Then, one of 
the three understanding approaches is applied. Given that 


text understanding is the most difficult case because of the 
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Figure 6.3 Diagram of the Natural Language Interface. 
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increased ambiguity, it was found appropriate that a special 
module for the disambiguation of the input would be 
extremely useful in solving the problem, by £ibingy, the 
product of semantic analysis with additional linguistic 
information and resolving various types of references. 

The next step includes the transformation of a 
complete interpretation of the input - to some kind of code 
(meaning representation code). This code is ready to be 
translated in the application translation module, into a 
representation which makes explicit reference to the charac- 
teristics and structure of the specific application. The 
translator, using facts from user's conceptual view and 
information about application characteristics like files, 
records, fields etc in case of database, takes the meaning 
representation code and produces an application-specific 
code. This code consists of various commands which are 
accepted by the application control system, for example the 


database management system. 
4. Tools 


several tools are employed at the various stages of 
processing the natural language input at the interface: 
° Application dictionary: It consists of words which are 


related to the application (domain specific). 


¢ General dictionary: It contains words which are not 
directly related Ee the application (domain 
independent). 

e Syntactic rules: Rules used for the syntactic analysis 
(parsing). 

e Semantic rules : Rules used for semantic analysis. 


* Context information: Information referenced in case 
- The semantic analyzer needs information about context 


to assign meaning to a specific statement. 
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- Complex or interrelated statements or text are 
entered 
e Discourse analysis rules: Rules which determine the way 
in which context in formation will be used. 

Another important module which was left to be 
described at the end because it is not directly retated with 
the dialogue, is the knowledge acquisition module. This 
module allows the user to extend the domain of the interface 
by creating new entries and rules or modifying the domain by 


changing entries and rules. 
5. Parsing and Understanding with Word Experts 


This approach makes use of a model of natural 
language understanding, called Word Expert Parser 
[Ref. 1O0:pp. 89]. Under this approach each word is treated 
as a ocomplex procedural knowledge source which contains 
decision paths, which reflect the range of knowledge about a 


specific word, necessary to understand that word in various 


contexts. The idea behind this approach is that a language 
can be understood through a controlled exchange of mean- 
ingful signals and concepts among internally complex 
experts. 


The Word Expert Parser (WEP), is organized in three 
levels: 
e Top : it represents the control environment where the 
C@XpDeCr esr un. 
¢ Middle :it consists of the protocol for the inter- 
expert communication. 
¢ Low : it concerns the intra-expert structure of the 
expert's decision logic. 
Considering the general model environment, there is 
one expert for each morpheme (syntacticaly or semanticaly 
meaningful lexical unit which cannot be further subdivided). 


During lexical analysis, each sentence is morphologicaly 
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analyzed to identify the relevant word experts which then 
are manipulated by the parser. Soueioleoe Parsing is first 
assigned to the leftmost expert which, thereafter, takes 
control of the model. The other words are read "on demand" 
by other experts or after request from the top level. 

The control environment, where the experts run, 
contains the following features: 

* memory context : results of understanding processes of 
previous sentences. 

* concepts : data objects which represent understanding 
thoughts in various stages of refinement and which 
become available to running experts when they do 
appropriate questions. 

* parser state : description of the current focus of the 
parser, as it is described by previous running experts. 

* signals : information about parser nature. 

¢ demons : mechanisms which watch computational results 
of the running experts and reactivate other experts 
when required or after a given period of time has been 


passed. 
a. Running a Word Expert 


Experts are activated in three cases: 
¢e Questions are addressed to them by other experts. 
e They must respond to newly created concepts or signals. 
¢ They become the next word to be read in the sentence. 
Four parameters characterize the reasons of 
expert awakening: 
e Reentry points : inform experts where decision logic 


has to be restarted. 


e Signals : indicate the nature of expert awakening. 
e Concepts : accompany various experts. 
e Reference : reference to another expert if awakening 


has occurred in response to a prior request. 
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b. Word Expert behavior 


The behavior of aword expert is expressed 
through primitives which establish the range of that 
behavior. Several such primitives [Ref. 10:pp. 97], are 
suggested: 

¢ Request for the next word of the sentence which must be 
read. 

¢ Information request from other experts. 

¢ New information reporting. 

¢ Building new concepts or contributing to existing 
concepts. 

¢ Information requests about concepts. 

e Suspending running experts to await future reawakening 
by time-out demons. 

e Terminating expert running after identifying word's 
role in the sentence. 

These primitives are organized in two sets; the 
Lexical Interaction Language (LIL) which contains primitives 
for inter-expert communication and the Sensed Discrimination 
Language (SDL) which contains experts concerning expert's 
decision logic. Both sets of primitives and their classifi- 


cation are illustrated on Table II. 


c. Sentence understanding with Word Expert Parsing 
ines Dake, the implementation of the WEP 

approach on the sentence "A ship carrying missiles", is 
provided. The entire process is accomplished in 21 steps as 


it is described below: 
* The parser reads the word "A", retrieves its relative 
expert from the memory and starts running it. 
¢ The expert of word "A" is being prepared to accept the 
upcoming concept (ship). 


e Initial construction of the "ship" expert. 
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TABLE II 


Word Expert Behavior Primitives 


LEXICAL INTERACTION LANGUAGE PRIMITIVES 


Posting Restarting Demons | 
AWAIT CONCEPT, AWAIT SIGNAL, AWAIT WORD 


Look-Ahead Functions 
PEEKW, READW 

Message Sending 
SIGNAL, REPORT 


SENSE DISCRIMINATION LANGUAGE PRIMITIVES 
banguastic Growp Structuring 
LITERAL, OPENG, DECLAREG, BREAKG, LINK, CLOSEG, IDIOM 


Concept Building and Augmenting 
SUILDC, REEINEC, ROLEC, ASPECTC, STOREC 


Control Flow 
NEXT, CONTINUE, PAUSE, ALIAS 


Memory Interaction 
VIEW, BINDC 


* Running of the suspended expert "A" is resumed, after 
the construction of the concept which refers to the 
expert “ship" has been reported. 


¢ Initial execution of the “ing" expert. 


ial 


The "ing" expert is being prepared to accept a response 
about the main verb (carry). 

Initial execution of the "carry" expert which then 
branches to an action node in order to be tied with the 
"ing" expert. 

Execution of the "carry" is resumed. 

Word expert for "ing" resumes its third execution, to 
give the "ing" expert the name of the structure being 
built by carmus 

The "s" expert initiates a new sequence. 

The "ing" expert branches to a node to wait for the 
upcoming concept. 


The "s" expert continues awaiting the same concept 


awaited by “ing 
The "missile" expert discriminates among possible word 
senses. 

The "s" expert opens new lexical group changing state 
of the parser Eo “action construction. 

The "missile" expert resumes execution reporting the 
concept structure it has created initially. 

The "s" expert resumes execution using the desired 
concept as input. 

The "ing" expert resumes execution to sort out the 
meaning of the text fragment for which it has coordi- 
nated the parsing process. 

The "carry" expert resumes execution to process’ the 
concept structure sent by the "ing" expert. 

The "carry" expert resumes execution refining the 
meaning of its action. 

The "ing" expert resumes for final execution. The 
carrying action and its object are stored in the active 
memory. 


Execution of the expert, announcing the end of the 


sentence and completion of analysis. 


eZ 


EXPERT1 
ENTRY INITENTRY 
EXPERT : *WEP* 
INI TENTRY 
ENTRYO 
NO/A OPENG 
DECLAREG 
EXPERT1 
ENTRY : ENTRY1 
EXPERT : *WEP* 
ENTRY1 
NO/A AWAIT 
EXPERT4 
ENTRY : INITENTRY 
EXPERT : EXPERT1 
INITENTRY 
ENTRYO 
NO/Q SIGNAL 
N3/A DECLAREG 
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na 


SIGNAL “BREAK * 
CONCEPT NIL 
SENDING 
*ENTITY=-CONSTRUCTION* 
SIGNAL 


POSTING SIGNAL DEMON 
RESUME ON FIRING AT 
ENTRY 1 


os 


"PAUSE. 
NIL 


SIGNAL : 
CONCEPT : 


POSTING CONCEPT DEMON 
RESUME ON FIRING AT 
ENTRY 2 

RESUME ON EXPIRATION 
ALT’: ENTRY3 


ft Sie 


SIGNAL 
at itr eONSs TRUCTION* 
CONCEPT NIL 


BRANCHING ON 
pmive lary -CONSTRUCTION* 


CONTINUE 


ENTRY1 
NO/A CREATEC CREATING C#CONCEPT?7 : 
ENTITY 
REE INEC REFINING C#CONCEPT?7: 
C#L#SHIP 
REF INEC REFINING C#CONCEPT?7: 
C#DDG=-SSBN 
LINK 
CLOSEG SENDING 
*COMPLETE=ENTITY* 
SIGNAL 
N1/A REPORT REPORTING CHCONCEPT7 
EXPERT1 "A" 
ENTRY "=; BN iRyvz SIGNAL : NIL 
EXPERT : EXPERT4 CONCEPT :CHCONCEPTS 
ENTRY 2 
NO/A BINDC BINDING IN 
FOCUS AREA FAILS 
N2/Q BOUND CONCEPT2 VARIABLE 
UNBOUND 
N4/A REPORT C#CONCEPT7 
EXPERTS "ing" 
ENTRY : INITENTRY SIGNAL 
*COMPLETE-ENTITY* 
EXPERT : EXPERT4 CONCEPT : NIL 
INITENTRY 
ENTRYO 
NO/Q SIGNAL BRANCHING ON 


-COMPEETE ENT iy ~* 
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N3/A OPENG 
DECLAREG 
EXPERTS 
ENTRY : ENTRY1 
EXPERT : *WEP* 
ENTRY1 
NO/A BUILDC 
BINDC 
READW 
SIGNAL 
AWAIT 
EXPERTS 
ENTRY : INITENTRY 
EXPERT : EXPERTS 
INI TENTRY 
ENTRYO 
NO/Q SIGNAL 
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SENDING 

*ACTION-CONSTRUCTION* 

SIGNAL 

POSTING SIGNAL DEMON 
RESUME ON FIRING AT: 

ENTRY1 


ing 
SIGNAL *PAUSE 
CONCEPT NIL 


CREATING C#CONCEPT12: 
ENTITY 
CHCONCEPT12ROLE: 
AGENT 

BINDING IN 

ACTIVE AREA SUCCEEDS 


SENDING *ING* SIGNAL 
JUO) Ee S327 ERY SM be, 

POSTING SIGNAL DEMON 
RESUME ON FIRING AT: 
ENTRY 2 

RESUME ON EXPIRATION 
Aa ENTRYS 


"carry" 
SIGNAL "ING" 
CONCEPT NIL 


BRANCHING ON "ING" 


N4/A BINDINPUT 
DECLAREG 
ALIAS 
CONTINUE 

ENTRY1 

NO/A CREATEC 


REF INEC 


REF INEC 


LINK 


CLOSEG 


PAUSE 


CREATING C#CONCEPT?7: 
ACTION 

REFINING C#CONCEPT?7: 
CHL#CARRY 

REFINING C#CONCEPT?7: 
C#CARRY 


SENDING 
*COMPLETE-ACTION* 
SIGNAL 

POSTING SIGNAL DEMON 
RESUME ON FIRING AT: 
ENTRY4 


EXPERTS "carry" 


ENTRY : ENTRY4 
EXPER(ae Wick = 


ENTRY7 
NO/A SIGNAL 


AWAIT 


AWAIT 
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SIGNAL : *PAUSE* 
CONCEPT : NIL 


SENDING "ING" SIGNAL 
TO: EXPERTS 

WITH: CH#CONCEPT?7 
POSTING SIGNAL DEMON 
RESUME ON FIRING AT: 
ENTRY3 

NO RESUMPTION ON 
EXPIRATION 

POSTING SIGNAL DEMON 
RESUME ON FIRING AT: 


EXPERTS 
ENTRY : ENTRY2 
EXPERT : EXPERT11 
ENTRY 2 
NO/A AWAIT 
EXPERT21 
ENTRY : INITENTRY 
EXPERT : EXPERTS 
INITENTRY 
ENTRYO 
NO/Q SIGNAL 
N2/A OPENG 
DECLAREG 
PAUSE 
EXPERTS 


Gel) 


ing 


ENTRYS5 
NO RESUMPTION ON 


EXPIRATION 

ing 

SIGNAL "TING" 
CONCEPT C#CONCEPT7 


POSTING SIGNAL DEMON 
RESUME ON FIRING AT: 
ENTRY4 

NO RESUMPTION ON 
EXPIRATION 


s" 


SIGNAL 
*COMPLETE-ACTION* 
CONCEPT : NIL 


BRANCHING ON 
*COMPLETE=-ACTION* 
SENDING 
*ENTITY-CONSTRUCTION* 
SIGNAL 


POSTING SIGNAL DEMON 


RESUME ON FIRING AT: 
ENERY 2 


ENTRY : ENTRY4 


EAPERD > "EXEERee 


ENTRY4 
NO/Q SIGNAL 
N1/A AWAIT 


EXPERT21 
ENTRY : ENTRY2 
EXPERT : *WEP* 
ENTRY 2 
NO/A AWAIT 
EXPERT22 


ENTRY : INITENTRY 


EXPERT : EXPERDZi 


INITENTRY 
ENTRYO 
NO/Q SIGNAL 
N3/A CREATEC 


fae: 


SIGNAL 
*ENTITY -CONSTRUCTION* 
CONCEPT : NIL 


BRANCHING ON 
*ENTITY-CONSTRUCTION* 
POSTING CONCEPT DEMON 
RESUME ON FIRING AT: 
ENTRY5 

NO RESUMPTION ON 
EXPIRATION 


og" 


SIGNAL : PAUSE 
COnGEPI : NIL 


POSTING CONCEPT DEMON 
RESUME ON FIRING AT: 
ENTRY 1 

NO RESUMPTION ON 
EXPIRATION 


NTS e., 


SIGNAL: 
*F ENTITY -CONSTRUCTION* 
CONCEPT > NG: 


BRANCHING ON 
*FENTITY-CONSTRUCTION* 
CREA LING: 


REF INEC 


DECLAREG 


LINK 


CLOSEG 


BUILDC 


AWAIT 


EXPERT30O 


ENTRY : INITENTRY 


EXPERT : EXPERT23 


INITENTRY 
ENTRYO 
NO/Q SIGNAL 
N4/A OPENG 


DECLAREG 


EXMEPERT2Z2Z 


ENTRY : EMERY 3 
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CHCONCEPT26: ENTITY 
REFINING: 
CHCONCEPT26: 
C#L#SHIP 


SENDING 
*COMPLEX=-ENT Tie 
SIGNAL TO: EXPERT30 
CREATING 

CHCONCEPT38 ROLE: NIL 
POSTING CONCEPT DEMON 
RESUME ON FIRING AT: 
ENTRY 2 

RESUME ON EXPIRATION 
Me ENTRYS 


gs" 


SIGNAL 
*COMPLEX=ENTITY* 
CONGEPT : NIL 


BRANCHING ON 
*COMPLEX=-ENTITY* 
SENDING 
*ACTION-CONSTRUCTION* 
SIGNAL 


tt ship" 


SIGNAL : *ELSE* 


EXPERT : @WwEP* 


ENTRY3 
NO/A REPORT 


EXPERT21 


ENTRY: SENTRY 
EXPERT ©: SBAPERE ce 


ENTRY1 
NO/A REFINEC 
REPORT 
EXPERTS 


ENTRY : ENTRYS 
EXPERT (2 eee eee 


ENTRY5 
NO/A BUILDC 


BU ILDC 


ASPECTC 
Robe. C 
BUILDC 


BINDC 
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ing 


CONCEPT : NIL 


REPORTING CH#CONCEPT26 


eg! 


SIGNAL : NIL 
CONCEPT : C#CONCEPT26 


REFINING C#CONCEPT26 
REPORTING C#CONCEPT26 


SIGNAL : NIL 
CONCEPT : C#CONCEPT26 


CREATING C#CONCEPT33: 
ACTION 
C#CONCEPT33 ROLE: 
NIL 

CREATING C#CONCEPT34: 
ACTION 
CHCONCEPT34 ROLE: 
NIL 


CREATING C#CONCEPT35: 
ACTION 

CHCONCEPT35 ROLE: 
NIL 

BINDING IN PLAUSIBLE 
AREA SUCCEEDS 


CONCEPT6 VARIABLE 
BOUND TO 

CHCONCEPT33 

CREATING C#CONCEPT36: 
Ber cy 

CHCONCEPT 36 ROLE: 
Nii 

CREATING C#CONCEPT37: 
ENGI ryY 

C#HCONCEPT37 ROLE: 

NIL 

BINDING TO ASPECT OF 
CHCONCEPT33 SUCCEEDS 
BINDING TO ASPECT OF 
CHCONCEPT33 SUCCEEDS 
SENDING *AGENT* SIGNAL 
LO: SEXPERTS 

WITH: C#CONCEPTS 
SENDING *OBJECT* SIGNAL 
TO: EXPERTS 

WITH: C#CONCEPT26 
POSTING CONCEPT DEMON 
RESUME ON FIRING AT: 
ENTRY6 

NO RESUMPTION OF 
EXPIRATION 


EAPERITO Garry 


N1/9Q BOUNDC 
N2/A BUILDC 
BUILDC 
BINDC 
BINDC 
SIGNAL 
SIGNAL 
AWAIT 
ENTRY : ENTRY3 
EXPERT : EXPERT10 
ENTRY3 
NO/A ASPECTC 
ROLEC 
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so IGNAL 
CONCEPT 


AGENT 
CHCONCEPTS 


CHCONCEPTS ROLE: 


AGENT 

EXAMINING CONCEPT: 
CHCONCEPTS 

POSSIBLE VIEWS: 
CHPERSON 

CHOBJECT 

C#SYSTEM 

CHOICE: C#SYSTEM 
REFINING C#CONCEPT13: 


EXPERTS "carry" 


N1/Q VIEW 
N4/A REFINEC 
ENTRY : ENTRYS 
EXPERT : EXPERTS 
ENTRY5 
NO/A ASPECTC 
ROLEC 
N1/0Q VIEW 
N2/A REFINEC 
REPORT 
EXPERTS 
ENTRY : ENTRY6 
EXPERT : EXPERT9 
ENTRY6 
NO/A STOREC 


SZ 


ing 


SIGNAL 
CONCEPT 


OBJECT 
CH#CONCEPT26 


CHCONCEPT28 ROLE: 
OBJECT 

EXAMINING CONCEPT: 
CHCONCEPT26 

POSSIBLE VIEWS: 
CHANYTHING 

CHWEAPONS 

C#PEOPLE 

CHOICE: CH#WEAPONS 
REFINING C#CONCEPT13 
C#CARRY =-WEAPONS 
REPORTING C#CONCEPT13 


SIGNAL : NIL 
CONCEPT CHCONCEPT13 


STOREC 
REPORT REPORTING C#CONCEPTY 


EXPERT49 "*per*" 


ENTRY : INITENTRY SIGNAL 
*COMPLETE<-ENTITY* 
EXPERT : EXPERT2Z2 CONCEPT : NIL 
INITENTRY 
ENTRYO 
NO/A BREAKG SENDING 


*BREAK* SIGNAL 


Meee interface Operation Algorithms 


Of great interest for the implementation of the 
natural language interface are the algorithms which describe 
the function of the natural language understanding portion. 
That portion will be represented in two levels of detail; 


the organization level and the functional level. The first 





concerns the organization of the interface to implement the 
three approaches and the second the components and functions 
of the system, in each case. 

Table III shows an algorithm written in pseudocode, 
describing the function of the interface at the organization 
level. In this algorithm the following modules are used to 
represent the three approaches: 

° Menu-driven dialogue module (MDD). 
* Single sentence dialogue (SSD). 
° Complex sentence dialogue (CSD). 

Menu-driven dialogue is described in algorithm of 
Table IV and flowchart of Figure 6.4 . The following modules 
are being used: 


¢e UPD_STATUS : updates status of dialogue. 
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TABLE IIT 


Interface Organization Algorithm 


ALGORITHM NLI_ORGANIZATION 


Begin 
read selection 
if selection = MMD then 
pega ; 
parser = active 
semantic analyser = active 
do MMD 
else ult selecuron SSD 
beqing 
exical analyzer = active 
parser = active 
semantic analyzer = active 
do SSD 
else if selection = CSD 
begin. 
exical analyzer = active 
parser = active 
semantic ane eS = active 
discourse analyzer = active 
do €s®» 


else 
do E_INTERRUPT 
end 
end if 
end if 
END 





* SYN SEMe@ANALY Solo: performs syntactic and semantic 
analysis. 

e E_INTERRUPT : interrupts and notifies user if input is 
invalid. 


¢ CD_GENERATION : generates meaning representation code. 
Single statement dialogue is described in the algo- 
rithm of Table V and flowchart of Figure 6.5 and includes 
the following: 
e LEX_ANALYSIS : performs lexical analysis. 
°e SYN/SEM_ANALYSIS : performs syntactic and semantic 


analysis. 
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TABLE IV 


Menu-driven Dialogue Algorithm 


ALGORITHM MENU_DRIVEN_DIALOGUE 


Be 
eoice "ENTER SENTENCE?" 
if answer = yes then 


do tie 


a" Ez INTERRUPT 
end if 
End 


MDD 


fee not_end_of_sentence do 
egin 

write "ENTER OPTION" 

read option 

mm Option = legal then 


be 
qe” UPD_STATUS 
do Soe ANALYSIS 


els 
Ee T _INTERRUPT 
end if 
end while 





SSoON/SM_INTERRUPT =: interrupts when syntactic and/or 
semantic analysis is incomplete. 

fea INTERRUPT =: interrupts when lexical analysis 1s 
incomplete. 


* U_INTERRUPT : interrupts after user's desire. 
° CD_GENERATION : generates meaning representation code. 
Complex statement dialogue is described by the algo- 
rithm in Table VI and flowchart of Figure 6.6 which include 
the following: 
e LEX_ANALYSIS : performs lexical analysis. 
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Figure 6.4 


° SrYNSANZE Ss 
© SEM_ANAB SES 
°* DIS_ANALYSIS 
° UPDSCON Th Ss 
cL INGERRUES 


incomplete. 






SYN TACTIC/ 
[SEMANTIC 
AMUALYSIS 


Menu-driven Dialogue Flowchart. 


performs syntactic analysis. 
performs semantic analysis. 
performs discourse analysis. 
updates context information. 


interrupts when lexical analysis is 
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TABLE V 
Single Statement Dialogue Algorithm 


ALGORITHM SINGLE_STATEMENT_DIALOGUE 


Begin , 
write “ENTER SENTENCE ? 
read answer 

if answer = yes then 


__" 
o SSD 
else 
go U_INTERRUPT 
end if 
End 


SSD 


begin 
ae LEX_ANALYSIS 
if LEX_ANALYSIS = complete 


a 
lo SYN/SEM_ANALYSIS 
ae SYN/SEM_ANALYSIS = complete then 
egin 
write "PROCESSING COMPLETE" 
do CD_GENERATE 


else 
eee SUPT 


end i 
end if 


else 
do L_INTERRUPT 
end 





mon INTERRUPT : interrupts when syntactic analysis is 
incomplete. 

wee INIERRUPT : interrupts when semantic analysis is 
incomplete. 

Sew INIERRUPT : interrupts when discourse analysis is 
incomplete. 


* U_LINTERRUPT : interrupts after user's desire. 
° CD_ GENERATION : generates meaning representation code. 
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ANALYSIS 


L. INTERRUPT 


GENEROTION 


Figure 6.5 Single Statement Dialogue Flowchart. 
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TABLE VI 
Complex Statement Dialogue Algorithm 


ALGORITHM COMPLEX_STATEMENT_DIALOGUE 


Be 

fete "ENTER STATEMENT ?" 
read answer 

if answer = yes then 


do" ze» 


iS “o INTERRUPT 
end if 
End 


CSD 


begin 
do LEX_ANALYSIS 
if LEX_ANALYSIS = complete then 


be 
aon SYN_ANALYSIS 
if SYN HANALYSIS = complete then 


be 
a3" SEM_ANALYSIS 
ft Sai _ANALYSIS = complete then 


be 
a5" DIS_ANALYSIS 
if DIS_ANALYSIS = complete then 
b 
egin 
write "PROCESSING COMPLET" 
do UPD_CONTEXT 
do GENERATE 


else 
do D_INTERRUPT 
een ack 


els 
x5 SM_ INTERRUPT 
end if 
else 
do SN_INTERRUPT 
a. if 


els 
ae T_ INTERRUPT 
end if 


end 
kek ke k €& 


Sg 





i o> 
yes 


yas 


SEMANTIC 
ANALYSIS 


f Cy? 


D.INTERRUPT GENERATION 


yes 
OISCOURSA STATUS 
ANALYSIS DATING : 


Figure 6.6 Complex Statement Dialogue Flowchart. 
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VII. THE PRACTICALITY OF NATURAL LANGUAGES 

The purpose of this chapter is to examine the 
practicality of natural languages. We associate the term 
"Dractical" with anything that serves. a purpose. In this 
context, the practicality of natural languages can be 
defined as the degree to which they serve the purpose of 
effective man=-machine communication. 

The practicality of natural languages can be examined in 
four application areas: 

e Database querying. 

e Natural language programming. 

e Natural language dialogues. 

e Text knowledge systems. 

The idea of the approach is to identify the needs of the 
users in each application area and then to examine how well 


the natural languages serve these needs. 


A. DATABASE QUERYING 


The method which best serves the user is the menu-driven 
dialogue. The reason is , that although this method is 
somehow restrictive for the user, it does provide the big 
advantage that input statements are always within the 
semantic boundaries of the system. This approach addresses 
very well the needs of the user for meaningful feedback, 
minimum mental load and minimum stress for errors. The same 
method becomes too cumbersome if the natural language subset 
is large. 

Another issue in data retrieval applications concerns 
answers to queries. In this area the natural language 


greatly contributes in the presentation of the output in a 
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form which satisfies user's needs for readability and under- 


standability. A requirement which very often arises. and 
which, if it is not satisfied, may lead to erroneous 
answers, is that the system must take into account the 


intentions of the user when he/she does the question and 
respond appropriately by adapting its answer to user's 


intention. For example, in a question like 

Is there any record with Beethoven's 9th symphony? 
the system may respond 

Yes 


In this case it is possible that the customer will ask about 
the place where he/she expects to find the record. The 


system could in this case respond like 
Yes, shelf #5 in row A 


predicting that in a case where the record was available the 
customer would have asked about the place where the record 
1s. 

Another issue in taking answers in natural language is 
the amount of redundancy built into the answer. There are 
cases where too much information expressed in natural 
language hides the important points of the answer and makes 


the substance difficult to be distinguished. 


B. NATURAL LANGUAGE PROGRAMMING 


The difference between natural language programming and 
natural language querying systems is that the first has a 
broader scope; it has the task of translating indefinitely 
large subsets of natural language into full-scale program- 
ming languages and it is very difficult to limit the English 


subset. 
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The role of the programming languages is to provide a 
vehicle for the user to specify his/her instructions to the 
computer, in order for the computer to perform a given task. 
These instructions represent the user's intenstons . Of 
course natural language which every user knows well, is very 
G@eed at this point. 

The use of natural language as programming tool implies 
that the compiler will have to carry the load for recog- 
nizing the intenstons of the user expressed through the set 
of the instructions. This load becomes even heavier when 
user specifies his intentions indirectly. Natural language 
for example makes use of pronouns, noun phrases and ellip- 
tical references very often and any language subset which 
would not include such parts in its structure would not be 
too much practical in recognizing user's intentions. 

Several experimental systems have been reported. They 
allow a user to write programs in a restricted subset of 
English language. A recent example, [Ref. 11], showed that 
sophomore level programming students were able to program 
array-manipulation problems faster and with fewer mistakes 
in English than in PL/C. 

A person who writes programs in subsets of English 
language, feels sufficient freedom for expressing his/her 
instructions. The problem is that the same difficulty which 
makes recognizing user's intentions so difficult is the 
reason which makes providing feedback difficult. This point 
must become the main focus of the research if it is desir=- 
able for the natural language to survive as a programming 
language. 

There are two areas where the use of natural language 
seems practical. The first concerns the text knowledge 
systems and it is in some way illustrated by THEIRESIAS 
Peet. 12:pp. 87], a computer based consultation system 


intended to supply near-expert level advice on difficult 
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cognitive problems. The second concerns the use of a grammar 
to map procedures into English descriptions as an aid to 
documentation. Natural language gives the capability for 
bidirectional translation (source-to-target and target-to- 


source) through the use of a symmetric grammar [Ref. 13]. 


C. NATURAL LANGUAGE DIALOGUES 


Most systems in this area are in their infancy. The 
reason is that when people communicate between them, they 
leave many intentions implicit. In any case of human 
dialogue, theories of speech, acts and intentions come into 
play and and these theories are the focus of "mose 
experiments concerning natural dialogue systems. All these 
theories have to deal with problems concerned about what is 
intended rather than what is stated. This implies that some 
means which allow for tracking user's attention during the 
dialogue, have to be used. The trouble with this case is 
that user knows what topic has been chosen but the system 
must guess from the next sentence, what the topic of the 


preceeding sentence was. 


D. TEXT KNOWLEDGE SYSTEMS 


This area includes applications which concern automatic 
translation of natural language text into knowledge data- 
bases. The primary issue in this area is the same; to 
recognize the intentions of text's authors. 

Texts usually include dialogues, narrations and descrip- 
tions. They are written by each author for a certain 
purpose. Each text has built in the writing style of its 
author. Good writing styles are in general characterized by 
minimum redundancy, co-references and elliptic descriptions 


of events. 
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The intentions of dialogue participants and authors of 
narrations and descriptions are not usually reported in the 
text; they are left to be inferred by the reader. This is 
exactly the reason why the use of natural language for the 
creation of text knowledge systems is rather impractical at 
the present. 

Recognizing the intentions of the authors) and dialogue 
Peeticipants requires the use of discourse analysis theories 
and computational techniques which are still in the experi- 
mental stage and they must still be further developed 
before the application of natural language for the produc- 
tion of text knowledge systems become practical. When this 
moment arrives the user is expected to have access’ to any 
portion of the text, answers to questions and summaries of 


various text segments. 
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VIII. APPLICATIONS OF NATURAL LANGUAGES IN THE MILITARY 


mr sr 


ENV IRONMENT 


In the early 1970's the United States Department of 
Defense noticed a trend in rising software costs in major 
systems. The greater percentage , about 56%, of these costs 
were associated with the embedded computer systems while the 
other data processing activities took only 19% [Ref. 14]. 

There were several language-related reasons which 
contributed to this problem: 

* Use of languages which did not adequately support the 
existing applications. 

¢ Use of languages which did not support modern programe- 
ming methodologies. 

* Use of languages which failed as software development 
tools. | 

°* Use of languages which failed in dealing with the 
complexity of the required solutions. 

Because the majority of software costs in DoD were asso- 
ciated with embedded computer systems, the DoD turned its 
attention to that particular area. Embedded computer systems 
are minor systems within larger computer systems which are 
not primarily computational, such as weapon and control 
SYSTEMS. 

Embedded applications have some common characteristics: 

¢ They are usually large. 
¢ They are highly reliable. 
¢ They are physically constrained in relation to hardware 


address space and speed. 


¢ They have particular programming requirements for 
parallel processing, real-time control and exception 
handling. 
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¢e Highly experienced personnel. 

Although the technology had been improved to a point 
where very fast hardware and efficient compilers had been 
developed and promised efficient translation of programming 
languages, this had not made possible the use of the 
existing programming languages in embedded systems. At this 
point, the High Order Language Working Group was established 
which was assigned the responsibility of determining the 
requirements for DoD's' languages. Conducting a review 
throughout the military departments, federal agencies and 
other groups they came out with the result that most groups 
desired a language which should satisfy requirements like: 

seiagh precision 

¢ Pinpoint accuracy 

* Modern programming methodologies support. 
¢e Exception handling support 

e Very high reliability. 

These requirements led the DoD to sponsor the develop- 
ment of the ADA programming and design language. This 
language was assigned the responsibility to express program- 
ming and design solutions . 

Another problem, which is common within the data 
processing area, appeared in 1980 and is expected to affect 


DoD's data processing activities, concerns the availability 


of experienced personnel. This problem is faced by almost 
every organization today private, public, profit oor non- 
Profit . This problem is expected to become worse in the 
next decade. some reasons for this shortage in data 
processing personnel have been identified [Ref. 15] as 
follows: 


°e Reduced availability of professional data processing 
education in the USA. 
* Demand for users far exceeds existing availability. 


° High personnel turnover. 
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As an organization with a high degree of data processing 
activities, DoD is expected to "suffer" in both the embedded 
systems applications area and the transaction processing, 
the second biggest applications area. Iwo questions which 
arise at this point are: 

°* To what degree can user-friendly languages, and in 
particular natural language, contribute to solving the 
two prementioned problems which DoD faces? 

* Does the natural language merit any attempt of imple- 
mentation within DoD? 

Comparing embedded systems characteristics and require- 
ments to the capabilities of natural languages as they have 
been developed to this point we can easily see that 

¢ Natural languages do very poor in fulfilling embedded 
systems requirements, if they do at all. 

° They are of trivial importance as means of eliminating 
the problem of personnel availability. The reason is 
that embedded systems require highly experienced 
personnel, which has no need of using natural language 
and even more it will probably try to avoid it . 


These reasons are considered adequate to show that the 
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The other application area within the DoD concerns 
transaction processing. These applications are not operated 
or developed in such a_ strict and formal environment as 
embedded systems are. They have some common characteris= 
tics ~ including. 

* Fairly general specifications. 

e Short response time. 

°e Accuracy. 

¢ Moderate complexity. 

e Personnel of moderate experience. 


°° Precision. 
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Looking at these characteristics we see that there are 
some possibilities for implementing natural language appli- 
Cations but they are very limited. Pitsmaes aatesllt Of the 
fact that the factors of response time, accuracy and preci- 
Sion are still crucial and natural language applications 
cannot satisfy these requirements. With the progress 
performed so far in natural language applications, some hope 
exists for using natural languages for data retrieval appli- 
Ganwons . This is justified by the fact that there always 
will be possibility for having personnel who, at the begin- 
ning of their careers will desire a natural language 
dialogue with the system. 

There is, though, an area where natural language is 
expected to solve some problems. This area concerns training 
of people who have no data processing experience but they 
meee be used in various positions in computer systems. 
These people must be in some way educated in handling the 
computer. Every novice user in front of the computer feels 
quite uncomfortable and frustrated. He/she feels that 
he/she is facing a system which is unfriendly and that 
he/she will probably never be able to dominate it. He/she 
also feels that he/she has to use the system to perform a 
task under certain constraints and the system does not care 
if he/she has any problem in giving instructions to it. 

Natural language communication is expected to greately 
contribute to eliminating such problems. > Cake ingeaawach 
Simple queries in natural language for data retrieval, users 
will be able to overcome the frustration and stress they are 
feeling in front of the computer and gradually pass’ on to 
dialogues with limited natural language input, ending up 
after some time with the use of languages like SQL or 
GEmenil. 

As far as programmers are concerned, natural language is 


not indicated at all, even for educational purposes. 
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Natural language is still far from the point of usability a 
programming language. so it is better for these people to 
start their data processing education with one of the 
existing programming languages which best fits to the 


requirements of their upcoming jobs. 
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IX. CONCLUSION 


This research has focused in issues related to the 
design of Man-Machine Interfaces and particularly, abt 
concentrated on Natural Language Interfaces. 

The design objectives have been identified as follows: 

¢ Ease of learning 

¢ Ease of use 

¢ Flexibility with relation to user and task characteris- 
mcs 

e Transparency 

¢ Reliability 

As a possible solution for most of the problems in man- 
machine interaction, the use of natural language is seen as 
the means for communication of user ideas to the computer. 
Although natural language had been proved excellent for 
communication among humans, this does not happen in most 
cases of communication between human and compuer. The only 
application area where significant progress has been accom- 
plished, is the data retrieval applications. In the other 
areas, like Natural Language Programming, Natural Language 
Dialogue and Text Knowledged Systems, some progress has been 
made but, almost everything is in the experimental stage. 

Based on the general objectives of the man-machine 
interfaces and approaches to the combination of the compo- 
nents of a natural language understanding system, a general 
purpose natural language interface has been proposed. This 
interface is expected to be suitable for all three types of 
natural language communication tasks: 

¢ Simple statement understanding 

¢ Simple interrelated statements understanding 


e Text and dialogue understanding 


Od 


In order for the interface to address the objectives of 
mMan-machine communication and, even more, to satisfy the 
requirement for transportability, it was found appropriate 
that the natural language understanding portion will be 
separate from the application specific part. 

It is recommended that emphasis should be given in two 
Peomtirs: 

¢ Improving the understanding capabilities of the natural 
language portion of the system. 

¢ Creating translators which will effectively translate 
the meaning representation code according to the 


requirements of each particular application. 
Military organizations have to some degree similar 


processing needs as any other organization and they should 
be expected to suffer from the lack of competent data 
processing personnel during the upcoming decades. The 
contribution of natural language applications to the problem 
of having people with no data processing experience working 
at various data processing jobs is not expected to be 
significant in the immediate future. The only practical 
application within the military environment and with “eae 
progress performed so far in natural language understanding 
systems, is training of novice personnel in the begining of 
their careers and switching them to whichever language will 
be found appropriate when these people acquire some experi- 


ence in handling the computer. 


102 


APPENDIX A 
AN OVERVIEW OF THE LADDER SYSTEM 


LADDER (Language Access to Distributed Data with Error 
Recovery) is a computer system which provides answers to 
queries entered through terminals ina subset of natural 
language concerning a database with information about ships. 
The system accepts questions about the data expressed in 
natural language, and creates sequences of queries to the 
database management system (DBMS), determines which machines 
will process the queries, links those machines over the 
ARPANET, controls processing and prepares answers . to the 
questions. 

The more important characteristics of the LADDER, are 
shown in the Table VII 

LADDER is currently installed on three hosts on the 
ARPANET; SRI-KL at Stanford Research Institute and ISIB and 
ISID at the Informations Science Institute at the University 
of Southern California. It has been operational since June 
1976 and undergoes continous improvement. 

A demonstration of interaction between the user and the 
LADDER, is shown in Figure A.1l 
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TABLE. VEL 
Important Characteristics of the LADDER 
Dialogue is controllable by either the user or 
the system. 


Input in any combination of upper and lower 
case letters is acceptable. 


Ships can be specified by 


Questions may include simple and/or complex 
restrictions. 


User is allowed to extend the grammar 
dynamicaly and handle new types of questions. 


User is allowed to modify question format. 


User is allowed Uo) replacemmouseaucnice sonora 
questions with a single new question. 


Segments of sentences can be interpreted in 
the context of the previous question(s). 


User can control the amount of output. 
User can control the "amount" of interaction 


by changing the value of the verbosity 
parameter. 





104 


LADDER: 


USER 
USER 


LADDER: 


USER 


LADDER: 


USER 


LADDER: 


USER 


LADDER: 


USER 


LADDER: 


USER 


LADDER: 


Figure A.1 


Do you want instructions? 
no 


use dbms20 
PARSED! 


set verbosity to be =2 
PARSED! 


aeveot the destination of Kennedy and Whale 


ARS 

Give a Gaeocllatton Of Ship »S where 
either the name of the ship $ is Kennedy 
ship S is WHALE 


or the name of the 
DEST = NORFOLK, NAPLES 


how long will it take 
to arrive in Norfolk? 


for the Wainwrite 


Ellipsis has failed. 

LADDER does not expect the word ron Ke) 

follow "HOW LONG WILL IT TAKE 

cEnTes® for this word or symbol are: 
HIP>, FOR 


Tryin Ellipsis: 


Wainwrite 


how long will it take for the 
to arrive in Norfolk? 

PARSED! 

Give the location of the port NORFOLK 


Sivewene momar CLuising speed for, 
maximum cruising speed for, 
position and the time at position 

for the ship WAINWRITE. 


USED 16 #HRS 276 MXSPD 34 HRS 130 
i200) 37Q0ONO1700E DATE 17 Jan 79 


from 


done 
EXIT 
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Demonstration of Interaction with LADDER. 
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